Summary of An Interview with Nvidia CEO Jensen Huang About AI’s iPhone Moment

Summary An Interview with Nvidia CEO Jensen Huang About AI’s iPhone Moment – Stratechery by Ben Thompson stratechery.com

9,437 words - html page - View html page

One Line

Nvidia CEO discusses the future potential of inference becoming the primary way software is operated and their interest in developing consumer AI GPUs and smaller language models for cell phones, highlighting the importance of diversity, redundancy, data centers, and cloud services in their new business model.

Key Points

ChatGPT and AI's "iPhone moment" have increased demand for generative AI models, leading to an acceleration in demand for training and inference.
Nvidia is responding to the demand by working on large language models and inference platforms, but faces constraints in meeting demand due to factors such as chip production, data center availability, and customer demand.
Nvidia is shifting towards delivering their system as a whole and working closely with cloud providers to optimize performance for customers, launching with Oracle as their first OEM layer of clouds.

Summaries

194 word summary

Nvidia CEO Jensen Huang discusses the future potential of inference becoming the primary way software is operated, with generative AI eventually being available on every computer. Nvidia is interested in developing consumer AI GPUs and smaller, more performant versions of large language models that could run on cell phones within 10 years. The interview highlights the company's commitment to working with industries to use their Nemo and Picasso models with their own data. Nvidia's new business model involves working directly with customers to accelerate their end-to-end ML ops platforms and host their applications in the cloud. Huang emphasizes the importance of diversity and redundancy in building resilience for large companies and investing in fabs and interconnects for scaling up computing systems. He also highlights the importance of data centers as the future of computing and Nvidia's move into cloud services. The impact of ChatGPT and AI's "iPhone moment" on Nvidia's business is discussed, with an acceleration in demand for training and inference leading to constraints in meeting demand due to factors such as chip production, data center availability, and customer demand. Huang emphasizes the importance of being strict about inventory and purchase order obligations.

481 word summary

Nvidia CEO Jensen Huang discusses the impact of ChatGPT and AI's "iPhone moment" on Nvidia's business in an interview conducted in March 2023. ChatGPT is an easy-to-use application with incredible capabilities in generative AI that caused reverberations in every industry, waking up the entire industry to the potential of generative AI. The demand for generative AI models has increased due to their integration into large applications such as Microsoft Office and Google Docs. There is an urgency to train larger models and create supporting models for fine-tuning, alignment, guard railing, and augmentation. This has led to an acceleration in demand for training and inference. Nvidia is responding to the demand by working on large language models and inference platforms. The construction of AI supercomputers involves thousands of components and is a heavy process. The company faces constraints in meeting demand due to factors such as chip production, data center availability, and customer demand. The company had to take write-downs on gaming and data center products due to a disappointing year in sales. Huang emphasizes the importance of being strict about inventory and purchase order obligations. Nvidia CEO Jensen Huang discusses the company's ability to quickly change course and develop new platforms for different use cases in the inference business. He also emphasizes the importance of diversity and redundancy in building resilience for large companies, investing in fabs, and interconnects for scaling up computing systems. Huang highlights the importance of data centers as the future of computing and Nvidia's move into cloud services. The company is shifting towards delivering their system as a whole and working closely with cloud providers to optimize performance for customers. Nvidia is launching with Oracle as their first OEM layer of clouds. The interview with Nvidia CEO Jensen Huang highlights the company's commitment to working with industries to use their Nemo and Picasso models with their own data. Huang emphasizes that Nvidia wants to work with everyone, including AWS. The interview also discusses Nvidia's new business model of working directly with customers to accelerate their end-to-end ML ops platforms and host their applications in the cloud. Nvidia can help accelerate end-to-end processes and refine platforms for both established companies and newcomers to the field. Huang recommends using cloud services like SageMaker or Azure ML for those new to machine learning or AI unless they require a bespoke framework platform. Finally, Huang discusses Nvidia's focus on solving problems that only they can solve, rather than competing with others on things that everyone can do. Nvidia CEO Jensen Huang discusses the future potential of inference becoming the primary way software is operated, with generative AI eventually being available on every computer. Inference will be done on both local devices and in the cloud. Nvidia is interested in developing consumer AI GPUs and smaller, more performant versions of large language models that could run on cell phones within 10 years.

1660 word summary

The excerpt is not related to the subject of the document and does not provide any meaningful information. It includes boilerplate, search options, and other irrelevant details. The actual content of the interview with Nvidia CEO Jensen Huang about AI's "iPhone moment" is not included. Nvidia CEO Jensen Huang discusses the advancement of Moore's Law and the potential for inference to become the primary way software is operated in the future. He predicts that generative AI will eventually be available on every computer and that inference will be done on both local devices and in the cloud. Huang also mentions Nvidia's interest in developing consumer AI GPUs and the potential for smaller, more performant versions of large language models to run on cell phones within 10 years. The interview with Nvidia CEO Jensen Huang discusses the centralized versus localized compute for AI and how Nvidia focuses on solving problems that only they can solve, rather than competing with others on things that everyone can do. They celebrate the success of their partners, whether they use CUDA or not, and prioritize making it easy and cost-effective for them to achieve their goals. Huang also addresses concerns about commoditization and efforts by companies like Meta to expand PyTorch. Nvidia CEO Jensen Huang discusses the benefits of working with his company for applications that require generative AI-based video-based storytelling at high quality. For other applications such as spell checkers, he recommends using cloud services that are already accelerated on GPU. Huang believes that most companies should go directly to the cloud and work with their partners to ensure infrastructure and services are accelerated. However, for those who need experts to develop acceleration layers or algorithms, working directly with Nvidia is beneficial. He also suggests using cloud services like SageMaker or Azure ML for those new to machine learning or AI, unless they require a bespoke framework platform. The explosion of innovation in AI may occur in a fundamentally different layer of the stack going forward, further up on top. The article discusses Nvidia's role in assisting companies with accelerated computing and AI. There are two groups of customers: established companies that Nvidia has been assisting and newcomers to the field. Nvidia can help accelerate end-to-end processes and refine platforms. They work with large companies like Amazon, Microsoft, and Vertex AI, but also assist smaller companies without large engineering teams. Nvidia is well-positioned to assist companies that have recently realized the need for AI and generative AI. Nvidia CEO Jensen Huang discusses the company's new business model of working directly with customers to accelerate their end-to-end ML ops platforms and host their applications in the cloud. The company can engage directly with customers thanks to Nvidia being in the browser. Huang also notes that the company already works directly with end users and developers in various industries, such as healthcare, automotive, and video games. The fulfillment of the system historically has been through someone else, but now Nvidia can fulfill it directly or through another CSP or OEM. The new business model provides an opportunity for Nvidia to generate subscription revenue instead of relying solely on product sales. The interview with Nvidia CEO Jensen Huang discusses the company's commitment to supporting industries in using their Nemo and Picasso models with their own data. Huang emphasizes that Nvidia wants to work with everyone, including HP, Dell, Lenovo, and AWS. While AWS was notably absent from Nvidia's recent joint press release, Huang believes that AWS has an interest in continuing to partner with Nvidia in a deep way. The interview also touches on the technical aspects of DGX Cloud and how Nvidia will intermediate relationships with host providers for their customers. The CEO of Nvidia, Jensen Huang, discusses the collaboration between their company and cloud providers to optimize performance for customers. The architecture is not diverse enough to make a noticeable difference between cloud providers, but there may be slight differences in experience. Nvidia works closely with each cloud provider to integrate their software architecture into the cloud provider's system. Oracle is a good example of a cloud provider that will build the full Nvidia stack, while AWS has its own competitive Nitro layer. Nvidia is launching with Oracle as their first OEM layer of clouds. Nvidia CEO Jensen Huang discusses the company's shift towards delivering their system as a whole, rather than through different clouds. This is the company's largest business model extension and involves a large and growing service organization to help people with their models. Nvidia works closely with cloud service providers (CSPs) and their salesforce and marketing to offer fully optimized computers compatible with their software stack. The goal is to extend Nvidia's architecture to all CSPs and provide the same software stack on any cloud, multi-cloud, hybrid cloud, or at the edge. Nvidia integrates their system into various companies and works with them to understand their needs and APIs. This allows Nvidia to be a vertically integrated systems company while connecting with the world. Nvidia CEO Jensen Huang discusses the importance of data centers as the future of computing and how Nvidia has built a vertically integrated system that operates from the cloud to the edge. He emphasizes the need for a software-defined system that can orchestrate the entire fleet of computers inside data centers as if it's one, with a separation of the compute plane and the control plane. Huang also discusses Nvidia's move into cloud services, such as DGX Cloud and Omniverse Cloud, which were pre-announced in previous interviews. Transparency is important to Nvidia's partners who depend on them, and their goal is to build a computing platform that's available everywhere. The CEO of Nvidia, Jensen Huang, discussed the importance of diversity and redundancy in building resilience for large companies. This includes investing in fabs (fabrication plants) in the United States and elsewhere, which can be more expensive but necessary for supply chain resilience. Huang also talked about the importance of interconnects for scaling up computing systems and the trade-off between speed and effectiveness. He also addressed the limitations of fused chips in complying with export controls, but assured that they still serve the needs of customers and run the same software. The interview with Nvidia CEO Jensen Huang discusses the company's ability to quickly change course, such as taking A100s and changing them to A800s for China, which was possible due to the speed of the chip being fine and the limitation being the memory and interconnect speed. The core technology for inference products at scale for data centers exists, and four new platforms have been developed for different use cases, including large language models and video editing at full film quality scale with generative AI. The scale of inference business has gone through a step function, and Nvidia is racing to meet demand while also focusing on generative AI work done in the cloud. The interview also touches on changes in the way Nvidia thinks about the business post-ChatGPT. Nvidia CEO, Jensen Huang, discusses the challenges of building AI supercomputers, including the need for switches, NICs, cables, and data center space. The construction of these supercomputers involves thousands of components and is a heavy process. The company faces constraints in meeting demand due to factors such as chip production, data center availability, and customer demand. The company had to take write-downs on gaming and data center products due to a disappointing year in sales. Huang emphasizes the importance of being strict about inventory and purchase order obligations. The demand for generative AI models has increased due to their integration into large applications such as Microsoft Office and Google Docs. There is an urgency to train larger models and create supporting models for fine-tuning, alignment, guard railing, and augmentation. This has led to an acceleration in demand for training and inference. Nvidia is responding to the demand by working on large language models and inference platforms. The ChatGPT model is a significant development that allows programming with natural language and represents a phase shift in computing. Companies are now considering the implications for their industry, competition, products, and business models. Nvidia CEO Jensen Huang discusses the three properties of computing platforms and how they apply to AI. ChatGPT and generative AI have driven an inflection point in AI adoption and created a new computing model. The accessibility of this new computer and the applications that can be built with it are brand new. Understanding the language of proteins and chemicals can lead to new opportunities for companies. The AI Moment, ChatGPT, has opened people's minds to the possibilities of AI. Executives who were not previously engaged with AI are now interested in it. Large language models will learn the language of everything that has structure, including the physical world. The CEO of Nvidia, Jensen Huang, was surprised by the effectiveness and widespread use of ChatGPT, an easy-to-use application with incredible capabilities in generative AI. It caused reverberations in every industry and woke up the entire industry to the potential of generative AI. Within 60 days, hundreds of startups were created and VCs were funding them. ChatGPT is unquestionably the most easy-to-use application that performs tasks that are consistently surprising to just about everyone. Its impact has dramatically changed views on the potential of generative AI. In this interview with Nvidia CEO Jensen Huang, conducted in March 2023, Huang discusses the impact of ChatGPT and what he calls AI's iPhone moment on Nvidia's business. He also touches on the announcement of Nvidia's new DGX Cloud service, how Nvidia responded to the Biden administration's export controls, TSMC's new plant in Arizona, running AI locally, and Nvidia's position in the stack in an LLM world. The interview is lightly edited for clarity and was conducted on the occasion of this week's GTC conference. The frequency of Nvidia's semiannual conferences might seem aggressive, but given Nvidia's central role in AI, Huang notes that their last talk seems like it was years ago.

Raw indexed text (53,008 chars / 9,437 words / 309 lines)

An Interview with Nvidia CEO Jensen Huang About AIs iPhone Moment Stratechery by Ben Thompson

Stratechery by Ben Thompson

On the business, strategy, and impact of technology.

About

Email | Podcast | RSS | SMS

Contact Support

Search for:

Posts

Topics

Concepts

Companies

Stratechery Plus

About Stratechery Plus

Member Forum

Account

Member

Delivery Preferences

Manage Account

Sign Out

Explore Stratechery

Concepts

Companies

Topics

Archives

Articles

Updates

Interviews

Years in Review

An Interview with Nvidia CEO Jensen Huang About AIs iPhone Moment

Posted on

Thursday, March 23, 2023

Tuesday, March 28, 2023

Author

Ben Thompson

Good morning,

I first spoke with Nvidia founder and CEO Jensen Huang after

last Marchs GTC conference

, and again after

last falls GTC

; as I observe in the interview below, Nvidias semiannual conference frequency might seem very aggressive, but given Nvidias central role in AI our last talk seems like it was years ago.

In this interview, conducted on the occasion of

this weeks GTC

, we discuss what Huang calls AIs iPhone moment ChatGPT and how that has affected Nvidias business. We also touch on the biggest announcement from GTC Nvidias new DGX Cloud service while also discussing how Nvidia responded to the Biden administrations export controls, TSMCs new plant in Arizona, running AI locally, and Nvidias position in the stack in an LLM world.

To listen to this interview as a podcast, click the link at the top of this email to add Stratechery to your podcast player.

On to the interview:

An Interview with Nvidia CEO Jensen Huang About AIs iPhone Moment

This interview is lightly edited for clarity.

Topics:

The Impact of ChatGPT

Nvidias ChatGPT Response

China and TSMC

DGX Cloud

The DGX Cloud Customer

CUDA and Commoditization

Centralized vs. Localized Compute

The Impact of ChatGPT

Jensen Huang, welcome back to Stratechery.

JH:

Thank you, Ben. Nice to see you.

The last time we talked was in

September of last year

, Im honored that Stratechery is sort of your

post-GTC go-to

. Two months later after that,

ChatGPT was released

. You said on

your earnings call last month

that, Theres no question that whatever our views are of this year, they have been dramatically changed. Lets start with why. Why have your views dramatically changed? What was the impact of ChatGPT?

JH:

Well, weve seen GPT-3 out there for a while and weve seen

WebGPT

and weve seen

InstructGPT

, weve seen pieces of GPT released and each one of them were really spectacular. But what really woke the whole industry up was ChatGPT. This is an extraordinary application, this is unquestionably the most easy-to-use application that has ever been created that performs tasks that are consistently surprising to just about everyone. You could be someone who doesnt know very much about computing and interact with it and be surprised with the coherent response. You could be somebody who is deep in computer science and still marveled by the apparent reasoning and problem solving that it is able to do. I think that across the entire spectrum, people were surprised by the incredible capabilities of ChatGPT.

And that woke up the entire industry, in every industry. It was the conversation of every boardroom, Im sure, it was the conversation of every engineer around the world, and it was a dinnertime conversation, children were talking about it, theres never been anything like it. This is a big moment, of course everybody was surprised by the widespread use of ChatGPT, but what was immediately happened after that is how every cloud and every software company woke up and asked, What does this mean to us? and simultaneously, hundreds of startups were being created and incredible applications were being built with generative AI, both image generative AI as well as language generative AI or a combination of the two, and these startups were coming out of the woodworks. VCs were funding it, the VCs were writing blogs about generative AI, this was all happening within about sixty days.

So if you were to apportion the surprise, because I kind of heard two answers there, one is just surprise at the capabilities and one is the mass response. Was it a little bit of column A, a little bit of column B as far as your surprise as the CEO of Nvidia? Like you said, youd seen GPT before GPT-3 and its previous various iterations but was it not just that it was so capable but also like, Holy cow, theres so many people using it, this is going to have a meaningful impact on our business in a way I didnt expect?

JH:

I think we were all surprised by how effective ChatGPT was in both its ease of use as well as incredible capability. Almost immediately after, the ripples of ChatGPT around the world, cloud service providers, software vendors in all these different industries started to ask the question, what does it mean to them? The combination of both of those things, and they were both independently surprising I guess, but it happened very fast. Its not as if transformers werent developing and getting larger and larger, and it wasnt as if the innovation wasnt done in plain sight, but it was the ChatGPT moment when it all came together. The user interface was incredible. It was fine-tuned to produce incredible results, and it was all of that that came together in a flashpoint and because it went around the world literally instantaneously, the ripples, the reverberation around it from every industry happened very quickly.

A lot of people, they never directly engaged the work that we were doing. We were talking about it, but most of the executives that I knew, they see the GTC Keynotes, theyre really excited about them, but it doesnt affect them directly. Not one executive that I know, not one, has not been awoken by ChatGPT, now they call me and say Now I understand what you were talking about. All of those things you were talking about, I get it now. When I was explaining the transformers, these large language models, its first learning the language of humans, but its going to learn the language of everything, everything that has structure. So what has structure? Well, it turns out the physical world has structure, thats why were symmetric, thats why when I see the front of Ben, I have a feeling about the back of Ben.

You had an extensive bit on physics a couple of GTCs ago I think that was trying to make this point, but now people get it.

JH:

Yeah, and theres a language to proteins, theres a language to chemicals, and if we can understand the language and represent it in computer science, imagine the scale at which we can move, we can understand, and we can generate. We can understand proteins and the functions that are associated with them, and we can generate new proteins with new properties and functions. We can do that with generative AI now, now all of a sudden those words make sense and now theyre connecting and fired up and are now applying it to all of their fields of their own companies and see opportunity after opportunity for themselves to apply it. So I think the AI Moment, ChatGPT, was a very important deal, it kind of opened everybodys mind.

The thing that is equally important is that ChatGPT and generative AI has several properties one, of course, is it caused and drove an inflection point in the adoption of AI which is the reason why training is going to up, which is the reason demand is going up. However, what its also done is it has created a new computing model where the way you program the computer and the applications that you can build with this new computer and the accessibility of this computer, meaning the people that can actually put it to work, is all brand new!

These three properties that I just described characterize computing platforms. You can use these three properties to describe minicomputers, workstations, PCs, the Internet, cloud, mobile cloud each one of them you program it in different ways, you write different applications, and the reach of this platform is different. So in the case of workstations it was measured in hundreds of thousands of people, in the case of PCs it was measured in hundreds of millions and billions, in the case of mobile devices it was billions, and whats interesting is that the number of applications in each case grew. There are five million applications in app stores, there are probably several hundred PC applications, theres 350,000, maybe 500,000 websites that matter how many applications are there going to be on large language models? I think it will be hundreds of millions, and the reason for that is because were going to write our own! Everybodys going to write their own.

This is a phase shift, a difference, an inflection point, a different way, theres definitely a new computing model here, this is profound. Were going to look back and were going to say Hey, guess what, we were there. The day this thing came to realization we were all there, we internalized what it meant to us, and we took advantage of it. A lot of companies are asking themselves, What does this mean? What does this mean to us? What does this mean to our industry? What does this mean to our competition? What does this mean to our products? What does this mean to our business model? This is happening right now as we speak, a very big deal, a really really big deal. Hopefully one of these days when we look back at this interview, we will remember when this was happening when you and I were talking.

I think thats very well put. You had a section right in the center, right before you introduced the inference platforms, where you walk through ChatGPT and the implication of people being able to effectively program with natural language. I thought that was the most compelling part of the presentation. You always go so fast from product announcement to product announcement, to have that little bit in the middle where you went Look, heres context, this is where its a big deal, I really enjoyed that part and thats sort of what you were driving at now.

JH:

Really big deal.

Nvidias ChatGPT Response

I guess the natural follow-up then is how have your views changed in a tangible business sense? What is Nvidia doing differently post-ChatGPT that they werent doing before?

JH:

Well, the first thing we have to do is respond to demand. There is clearly an acceleration of demand, a step up increase in demand, and a great urgency to deploy the resources in two different ways. First is training almost every major cloud service provider was already working on large language models, now they realize they have to get to the next level faster.

And not just the research side, but the inference side.

JH:

Thats the first thing is just to train these models faster, to get to the larger models sooner, faster, to develop all of the supporting models for fine-tuning and alignment and guard railing and augmenting and all of the supporting models that go around these large language models, everybodys got to work on that immediately right now and so the urgency of getting to the next level of large language models, which means bigger models, more data, and the urgency of creating all of the surrounding models, thats one. So training has experienced a step up in an acceleration in demand around the world.

Second, because generative model is so useful that its being connected into Microsoft Office, the most pervasive applications in the world, Google Docs, all of a sudden generative AI is now connected into some very, very large applications and web browsers of course and so the inference demand has gone through the roof. So I think both of those things are happening and the step up in demand, the urgency of delivery, both of those things are happening at the same time.

I want to get to the other shifts you might have made, but I do want to jump in on that meeting demand bit because I want your help to solve one of the biggest conundrums Ive had, which was in the second half of last year when you were taking

large write-downs on your inventory

, it turned out when you dove into your financial statements that a big chunk of those write-downs was not just chips youd already made and couldnt sell, but also future purchase order obligations with TSMC, and they were famously

being very strict about those

when everyone wanted space. How should I, as an analyst, balance those write-downs with the perception of there being an explosion in AI applications? Is the explosion overstated or is it just a matter of that happened before ChatGPT and things exploded after that? Or is this a gaming issue and you cant convert to AI? That disconnect has always stood out for me and Id like your help in solving it.

JH:

We had to take reserves, we had to take write-downs on gaming products and data center products, fairly obviously because we were expecting last year to be a lot bigger than it turned out and because cycle times were so long in the beginning of the year, it was fairly certain that we must have either built or made commitments to purchase a great deal more inventory and supply than we actually ended up selling. Last year was unquestionably a disappointing year.

Right, but it wasnt just stuff that you didnt sell, it was also those purchase obligations, which maybe Im wrong, but I assume thats stuff that hasnt yet been built but youve committed to build in the future.

JH:

In those cases, if we end up building it, then those commitments will likely be unwound, but well see how it turns out. It just depends on how much is future commitments, how much of it was really written off, how much of it is previous generation. For example, by the time we get there, it is unlikely that we need, as time goes on, the likelihood of requiring Amperes will be lowered because we want Adas

and Hoppers

and so we were expecting to sell a lot of Amperes last year, obviously. Theres a shelf life in that, and the Hoppers with the transformer engine is so good at these large language models, so well see how it all turns out.

Got it. What is the constraint on building more and meeting this demand? Is it just the ability to make chips? Is it data centers that can house them? Is it the customer base actually coming to the table and putting money down? Whats the limiting factor?

JH:

Well, youve kind of identified quite a few of them. First of all, people think that a GPU is a chip, but its not. If you think about what our data center GPUs are, its eight chips with two-and-a-half DCA [

unclear

] packaging interconnected into NVLink with a thermal management system that is off the charts, delivering a few thousand amps at a few gigahertz. It is a very heavy computer, I think just lifting a Hopper out of the oven is probably something along the lines of 70 pounds with 35,000 individual components. So building that GPU is, even if you had the five nanometer wafers, youre still missing a whole lot of components and then theres the manufacturing part of it.

Now thats just a GPU. In order stand up one of these AI supercomputers, you got switches, you have all the NICs, you have all the cables, and then you still have data center space you got to stand up, the PDUs all of it is critical path, none of its easy. These are the most advanced computers the world makes today, and we make them in volume and so were moving as fast as we can, everybodys in a bit of a race and theres a great deal of urgency to get there and so we recognize that and were working as hard as we can.

To go back to what Nvidia is doing different post-ChatGPT we hit number one, youre just racing to meet demand what else has changed about the way you think about the business?

JH:

Inference. The inference, the scale of inference business has gone through a step function, no doubt, and the type of inference that is being done right now where you know that video will have generative AI added to it to augment the video either to enhance the background, enhance the subject, relight the face, do eye reposing, augment with fun graphics, so on and so forth. All of that generative AI work is done in the cloud and so video has generative AI. We know that theres imaging and 3D graphics for generative AI, video for generative AI. One of my favorite companies is a company called

RunwayML

, which is basically video editing at full film quality scale with generative AI the video capability is incredible, anybody could do video editing now and that requires a different type of GPU.

Then theres large language models and the large language models span from more than 175 billion parameters, but more than that, and of course scaling all the way down to maybe 5 or 8 billion parameters and some 20 billion and 40 billion and so on and so forth, 75 billion, all kinds of different sizes and shapes. So theres the large language model inference where response time per token per character is really important because youre writing a very long letter or youre writing a very long program and people want to see it relatively interactively. If it goes offline, the patience for people using it is going to be challenged. Then these inferences have retrieval models and these retrieval models that have AI databases, vector databases, and it has its own type of inference model, and Grace Hopper is ideal for that.

You had those four chips or those four new platforms you called them that came out that were all dedicated to those four use cases, have those all been in the works for a long time? Some of them have analogies to previous generations like the Ampere generation, but some of them like the H100 language model one, I think that ones new. What was the speed from coming up with these to actually announcing them and developing them? Have they been in works for a long time or was there a bit where that was absolutely accelerated with the explosion in inference in particular? To the extent thats the case, its certainly an affirmation of your ability to take a core architecture and rework it very quickly into something else, Im just curious what the timing is there about those four new platforms?

JH:

All of the core technology exists. Turning them into inference products at scale for data centers, that happened in about six months.

China and TSMC

You mentioned one thing, just as an example of how you can quickly change course, you basically took all these A100s and changed them to being the A800s for China. By and large, the main limitation youre dealing with there is the speed of the memory in interconnect and the actual speed of the chip is fine, and now youre going to have the H800 going forward. Whats the implication of doing that? You talked about the DGX supercomputer a little bit ago and all the complexity that goes into it, obviously thats dependent on high speed memory interfaces between the chips. Is it even possible to build a DGX computer for China or is it like theyre going to have to figure out how to tie those chips together on their side?

JH:

The 800 series are fused on, means theyre physically fused, so that it gears down either the floating point processing or the interconnect or both, in order to comply with our export controls and so theyre physically fused and they cannot be unfused. But however, its form-fit function identical and it plugs into the same sockets, it runs the same software just slower. And its slower, but its still the worlds best. So it serves the needs of the customers that uses them and it complies. Ultimately, the most important thing is it complies with regulation.

Youve talked so much about the way youre thinking about accelerated computing and you love to mention that Moores Law is over, or slowed down, or however you want to frame it, and that its important to not just build a chip, but you have to build a system and then that has to scale up to even data centers. They still plug in, but with this fuse of the memory interconnects, are these still scalable to systems and to data centers and its just slow? Or is that a fundamental limitation on how that scaling can work?

JH:

It scales but not as effectively, it just depends on what scale you would like to go to. The importance of interconnects is fairly significant. The computing fabric that compute connects processors needs to be quite high speed. The faster the processors, the greater need for high speed computing fabrics and so its a matter of scale and the effectiveness of the scale. For example, if you want to increase to 1000 processors, the linearity of that scale up would be less linear and it would plateau earlier if the interconnects were slower and so thats basically the trade-off. Its just a matter of how far can you scale and what is the effectiveness of the scaling, the linearity of the scaling.

On the same theme, you were in Arizona for

TSMCs ribbon cutting

late last year, and TSMC has made comments in

their earnings

about how a part of their selling proposition going forward is going to be the cost incurred through increased geographic flexibility. Thats something youre going to pay for as a TSMC customer, is that a cost youre eager to pay or do you feel a little bit along for the ride here?

JH:

Ultimately every company needs to have diversity and resilience, that resilience comes from diversity and redundancy and in order to a achieve diversity and redundancy so that every company can have greater resilience implies building fabs in the United States and elsewhere, and those fabs are incrementally more expensive. In the grand scheme of things, those have to be taken into consideration. And so, theres a price to be paid for diversity and redundancy and we invest ourselves in our company and every large company in order to have resilience. Theres power redundancy, theres storage redundancy, theres security redundancy, theres all kinds of redundancy systems. Even organizations sales and marketing are dovetailing each other so that they can have some diversity and some redundancy so that you have greater resilience, engineering does the same thing. We have a lot of different ways to be a more resilient organization, this basically says we need to have more resilience in the supply chain. Now, in packaging and assembly and testing, we have diversity and redundancy already built in and that costs money and so these are all the things that large companies have to take into consideration.

DGX Cloud

This has been an ongoing theme of our conversations. It started when I first talked to you a year ago and I asked you if you would ever have a cloud service. It turns out, to go back to that interview, you pre-announced DGX Cloud in that you said, quote, If we ever do services, well run it all over the world in the GPUs that are in everybodys clouds in addition to building something ourselves if we have to. Now, last time we talked about

Omniverse Cloud

, which is the Building something ourselves, perhaps. But here,

you announced DGX Cloud

that runs in other peoples data centers. Were you already planning on DGX Cloud then, or is this something that you have really shifted your thinking around over the last year? What did you tell me, yet not tell me a year ago?

JH:

(laughing) I guess thats going to have to for you to decode. If you look at GTC, most of everything weve been building, Ive been describing and its been built in plain sight and the reason why its important for me to convey our direction fairly transparently at GTC is because we have so many partners that depend on us, and its important that they understand where were going. Weve been very transparent about and very consistent about our desire to build a computing platform thats available everywhere and this computing platform is built at data center scale. Todays computers is not a PC, todays computer is a data center, the data center is the computer and you have to orchestrate that entire fleet of computers inside the data centers as if its one. Thats why they call it single pane of glass, its managing one computer and thats why it has to be software-defined. Thats why you have to have a separation of the compute plane and the control plane.

All of those architectural reasons leads up to, basically, the data center is the computer. We build our entire system full stack, and then we build it end-to-end at data center scale but then when we go to market, we disaggregate this entire thing. This is the miracle of what we do, were full stack, were data center scale, we work in multiple domains, we have quantum computing here, we have computational lithography there, we have computer graphics here and this architecture runs all of these different domains, in artificial intelligence and robotics and such and we operate from the cloud to the edge and we built it in a full system, vertically integrated, but when we go to market, we disaggregate everything and we integrate it into the worlds computing fabric.

We integrate it into Dell, we integrate it into Lenovo, we integrate it into Quanta, we integrate it into Foxconn, we integrate it into Azure, GCP, AWS, OCI, so on and so forth and everybody has their own needs and everybody has their own way of operating and we work with them to understand what the APIs are. We take all of those considerations back into our company and we create a way that allows us to be a systems company on the one hand, vertically integrated, fully full stack, and yet when we go to market, we disaggregate everything and we connect it into the world. That is really the miracle of what weve done and thats only possible if I communicate to the ecosystem at all times what I intend to build so that they could be prepared for us on the one hand, we could be prepared for them on the other.

Now, if you look at our computing platform today, lets take an OEM for example. We are really an extension, Nvidia is really an extension of Dell, were an extension of HP and were an extension of them in the sense that whenever you want to buy an Nvidia computer, you call Dell and you specify the Nvidia computers that you want and they have it all available to them, fully compliant with our model, its fully compatible with all of our software stack and the customer could fully expect to have a fully optimized computer. Well, we should be able to do exactly the same thing at all the CSPs (cloud service providers). We should be able to extend our architecture at Azure, we should be able to extend AWS with Nvidia, and that extension of the computer should be exactly the same as what they would enjoy as the respect to run the Nvidia stack natively and they should be able to enjoy the computer, run that stack, that is, on any cloud, multi-cloud, hybrid cloud, all the way to the edge, that same software stack.

When we talk about cloud, thats essentially what I imagine and now theres a business extension. This is the largest, most significant business model extension weve ever had and in this relationship not only is Nvidia in the clouds, the CSPs, these instances, we take to market ourselves and when we win, they win. Of course they also have their own Nvidia instances and theyre welcome to take those to market, and we support them and so its a bit of an interesting model in the sense that our salesforce and our marketing and their salesforce and marketing are now working so closely together and in every instance we sell with them. Some of it comes directly through us because the customer wants to work with us directly, some of it goes through them because the customer enjoys working with them directly. Were fully supportive and happy with either model.

I certainly get the bit about how Nvidia has been broadly accessible through the different clouds. But to your point, this does seem like a pretty meaningful shift where you built this system and then as you mentioned, you disaggregate it to a certain extent, so it fits in the different clouds. But now you are actually delivering the system as a whole, where you dont go sign up with Oracle, you actually go to an Nvidia site and deal with Nvidia sales. You talked about youre going to help people with these models and that entails a large and growing service organization.

There are a lot of different angles on this, but youre launching with Oracle, thats the first one, and that

makes perfect sense to me

because you walk in and you get to not only bring them immediate differentiation off the top, but also because theyre starting from behind, they will build the full Nvidia stack, including your networking, including your management layer, as opposed to the other extremes like AWS where theyve invested heavily in their Nitro layer, which is kind of directly competitive with some of the stuff youve done to tie this together. Is that a good read of the situation where you now want to go directly to customers, and so just like a Dell or an HP is okay to build directly to your specification and be an implementation layer, is that a way to think about Oracle? We now have the OEM layer of clouds?

JH:

Its close. Its not so prescriptive in the sense that in the case of Oracle OCI, I worked with

Clay [Magouyrk]

and we worked with their architecture team to figure out, What is the best way to integrate into their cloud architecture? Its not so much that we deliver our systems and they stood that up, its far from that, we work very closely to figure out, What is the best architecture within their cloud that can be as performant as possible on our entire stack?. Our entire stack has to run natively, everything that we do has to run natively, including our orchestration layer, including just the system management layer or distributed computing layer. All of it has to run natively, and so we have to work closely with them to make sure that their cloud and our software architecture are going to work harmoniously together, so its a collaboration in every case.

When we did it with Azure, theres a collaboration that includes, Whats the best way to do the computing? Whats the best way to integrate with security? Whats the best way to connect the storage? How best to host it so that the customer with a lot of private data security issues and maybe industrial regulation issues for the data, that they could best manage it?. So theres a fair amount of collaboration around architecting for these different things.

So is there going to be a situation where maybe the experience is going to differ per cloud provider? There may be ones that are like, Look, if you want the full Nvidia experience, this has our networking, it has our management layer, Oh this one, it has a different networking which weve accommodated with and were working with it, but its not going to scale quite as well because its not fully tuned. Is this going to be a difference thats tangible to customers and youre going to communicate to them?

JH:

Yeah, sure. Ultimately, the customers going to pay for value and performance is a very important part of that. But ultimately the architecture weve implemented is not so diverse that its going to be that noticeable and so all of the clouds have worked really closely with us to enhance and optimize to the best of our abilities and it should all be good, Im expecting them all to be good.

When someone signs up for DGX Cloud, are they going to then choose which underlying host provider they want? How does that work? Do they have to also have a relationship with Oracle or with Azure or GCP or is Nvidia going to intermediate that relationship and you tell Nvidia what you want, then Nvidia will go find the place to put it?

JH:

If you dont care, then we suggest it. If you do care because you already have some kind of a pre-negotiated agreement with them, were happy to use that.

Got it. That makes sense as far as Azure for example, I think clearly, a DGX Cloud is very attractive to enterprises, particularly the bit where they can leverage your pre-trained models, they can put their own data in, they can have their own LLMs and they probably already have a relationship with Azure and it makes sense for Microsoft to want to be there, also GCP to an extent.

AWS is notable by its absence. I dont know if Ive seen so many logos in an Nvidia presentation, almost every slide had a logo. There was clearly a strong emphasis on the importance of partnerships for you going forward, but that did make the absences more notable. At the same time, you announced theres going to be the new P5 instance, which sounds like a DGX on AWS presumably through their software layer. Is there a technical hangup there or do just they feel like for their business, it makes sense to stick with AWS instances, not be intermediated?

JH:

Theres not much to read into it aside from we started with the other three. Theres dialogue, and its hard for us to stand up everybody all at the same time, we have every interest in doing it with AWS, I think AWS has every interest in continuing to partner with us in a very deep way. In our

joint press release

it described a very deep partnership. Were working together on SageMaker, were working together on all the infrastructure, we work together on AI, theres just a ton of stuff that were working together on, recommender systems, graph neural networks, their robotics system, the list of things that we work on together with AWS is really large. This new business model that we described, we started with OCI not because of anything aside from there was just a lot more compatibility there.

And they have the most to gain.

JH:

They potentially have the most to gain but we want to work with everybody in this way, just as we work with HP and Dell and Lenovo. Their time to market on each one of our platforms is not always at the same time, but at some point, we work with everybody all in the same way.

So are you going to have to really build out and expand your salesforce and support organization? It really jumped out to me when you were talking about your Nemo and Picasso models and how they can be foundation models and industries can bring in their own data with them. Youre like, We will be standing by to help you and assist you through this process. That made my ears perk up a little bit because to your point, previously someone would go to Dell to buy, theyre buying an Nvidia computer, but Dell is handling that layer of interaction with the consumer. This sounds like a pretty substantial commitment from a go-to-market perspective, an ongoing support perspective to interface directly with Nvidia is that something you need to build out going forward?

JH:

We already work very deeply with end users and developers who do these things. We do that today and our engagement with the worlds leading important verticals that we focus on, whether its healthcare, automotive, of course all the AI startups, we work with some 10,000 AI startups. So industry after industry, if there are industries where we could add a lot of value, the video game industry, we have direct coverage on just about every developer. The automotive industry, we have direct coverage on just about every single car company. The healthcare industry, were working with just about every drug discovery company and so we already do that today. Its just that the fulfillment of the system ultimately comes from somebody else. If you want your stack accelerated, you work with Nvidia. We work directly

with TSMC on cuLitho

and we work directly with ASML, its not as if some other intermediary is doing that on our behalf. The quantum researchers we work directly with and its just a fulfillment of the systems historically has been through somebody else. In this case, the fulfillment could still be through another CSP or some other OEM or it could be directly through us.

Is there a bit where with this new business model its kind of a relief because Nvidias always been so famously cyclical and you went through one of the downsides of the cycle last year and is there a bit in the back of your mind thats like, You know what? Some subscription revenue would be pretty nice instead of selling products.

JH:

That wasnt really the consideration, the consideration was there are many companies that are working on very important stacks themselves and the stack could be their enterprise version of machine learning frameworks. There are companies that standardize not on somebody elses ML ops, not on somebody elses ML platform, but their own and the reason for that is because they have a very large team internally and they have a lot of expertise and a lot of that stuff is quite proprietary. They believe they can do a better job and in that particular case, we would work with them to accelerate their end-to-end ML ops platforms and we would either host it if they choose to host it with us partially or all of it, or if they host it with one of our CSP partners, all of it or partially, were completely happy with it either way. So this is really a way for us to be able to engage directly with customers all the way to hosting, standing up their service, standing up their applications in the cloud and to be able to move faster and have greater reach. We can now, because Nvidias in the browser, instead of having to build up a DGX system, you could still build up a DGX system, but you can get going right directly from a browser.

The DGX Cloud Customer

Thats the part that Im very curious about, you mentioned these teams with large teams and large organizations that are their own structure, they are perfectly capable of getting a P5 instance from Amazon for example, and getting that all set up on their own. But a point you emphasized when talking about DGX Cloud was all the enterprises that dont have those teams, that woke up over the last six months to generative AI in particular and realized they needed something and Nvidias well placed to provide that. So heres my question: lets say theres an enterprise that comes up

JH:

Ben, maybe Ill just interrupt you.

Absolutely.

JH:

Your assumption was wrong. We have large teams working with Amazon on SageMaker. They have a large team and they need my large team to accelerate SageMaker. Vertex AI has a large team, they work with Nvidias large team to accelerate Vertex AI. Microsoft has a large team, but nobody knows accelerated computing better than we do. Nobody has a richer set of libraries that has to be integrated into their frameworks than we do and nobody has the scale of resources to help them accelerate and optimize their entire end-to-end microservices. We do that for a living all day long for people who have large engineering teams. Some of them have large engineering teams and the machine learning framework they created is not for languages, not for computer vision, but for drug discovery. They need somebody like ourselves to help them accelerate that platform end-to-end, and stand it up on a accelerated cloud.

So by working with us directly, they can accelerate it end-to-end and they can run it on any cloud that has Nvidia on it and if they decide that they would like to host it with us partially remember, if they host it with us partially or fully, then theyll have the benefit of engineers working with them continuously refining their framework, refining their platform. Accelerated computing is not like spinning up a CPU as you know, theres a lot of acceleration libraries that have to be created, theres just a lot of computer science. Otherwise, how is it possible that you overcome Moores Law? How do you overcome physics except for cleverness, right? And so that cleverness is computer science.

Makes total sense. So with that as grounding, is it fair to say then even with this large group of folks that work closely with Nvidia and Nvidia can help them do better and go faster, and whether that would be hosting or whatever might be, do you also see an opportunity of folks that have no idea what theyre doing and they feel like they need to get into this space? What is the relative size of that market of people that are coming into, This is something that we need for our business relative to the folks that youre assisting, have always been assisting. Am I correct in seeing there being two different parts here, the newcomers, for lack of a better word, and people that are already established? Or is this just an expansion of the people that are already established?

JH:

For people who are very new to machine learning or AI, they ought to just use one of the clouds.

Not your cloud?

JH:

Yeah, just one of the clouds because SageMaker is terrific and Azure MLs terrific, these are all terrific things. Unless you want to build something rather bespoke and rather optimized for a particular domain of use, you got to just use one of the if youre new to ML anyways, just start with any of the clouds. Theyre all excellent and weve integrated our GPUs into most of the worlds AI frameworks and so theyre all going to be accelerated somehow. Unless youre building either a bespoke framework platform and its very specific to you for your domain, but maybe its for industrial AI, maybe its not. Youre creating something thats really important and your whole company strategys behind it, and its for a domain thats not generic enterprise AI, its

not XGBoost

and so there you need some experts to help you develop. Not just develop the acceleration layer is one, but also maybe create some of the algorithms for it. So that would be an example of a company that would really benefit from working directly with us.

Maybe you want to create a proprietary large language model and you want to make sure that you can scale up that large language model, also refine it and deploy it and have that language model be as performant as possible and you have a lot of domain expertise, you have a lot of data scientists, but maybe you just dont have the large language model expertise we do we can help you. I think that anybody who could use a public cloud, anybody who could use a cloud service today, use it. We should not be the worlds fifth cloud service provider. Were not trying to be that.

CUDA and Commoditization

That last bit Im really interested in because Im curious do you think theres going to be companies that really never get into CUDA because that just didnt make sense for that ecosystem as it was, but now theyre all in on LLM. So its like theyre higher up, they want to build stuff on top of LLMs and you talk about all the startups thats sprouted up in the post-ChatGPT moment. Is the explosion of innovation is this going to be in a fundamentally different layer of the stack going forward, just further up on top and thats something that you foresee going forward?

JH:

It depends on what applications you perform. If youre creating a generative AI-based video-based storytelling service and youre generating video at very high quality, youre going to want to work with us to accelerate the living daylights out of that. Because the processing of that is just really long and so we could add a lot of value. However, if youre a spell checker, Im pretty sure that I would recommend you to a whole bunch of services and theres just no sense in proving that. So long as its already accelerated on GPU, and Im pretty sure itll be accelerated on just about everybodys cloud, I think its going to be pretty terrific. I would venture to say that 80% of the world should just go directly to the cloud, one of our partners, and well work directly with our cloud service providers to make sure that the infrastructure and their services and their APIs are all as accelerated as possible, and that would be terrific. For some of them that cant, for whatever reason, theyre welcome to work with us.

That makes total sense. But if theres 80% of the world that should just go to your partner because theyre higher levels of abstraction, is that a risk for Nvidia in terms of commoditization? Do you have any concerns about efforts by companies like Meta trying to

expand PyTorch

to abstract CUDA away and lessen your lock-in, and AMD and Intel want to obviously fill in this space and theyre more than happy a commoditization layer would be better to them because theyre behind. How much are you thinking about this and does it have any impact on your shift to the cloud? Or is this several things that may or may not be happening at the same time?

JH:

You build a great company by doing things that other people cant do. You dont build a great company by fighting other people, a whole bunch of other people, to do things that everybody can do. Notice I said if any of our CSPs partners of whom weve already accelerated their stack, and were delightfully accelerating their stacks and were working hard to accelerate their stacks, and we announced a whole bunch of machine learning platforms that weve accelerated. Some of them are in the cloud, some of them are in third parties, anybody with an ML ops pipeline, ML ops framework, anybody with a framework, well help you accelerate it. And if somebody can use that and make their lives easier and more convenient, were delighted by that, we should work on things that only we can work on.

Look at everything that Nvidia does. We dont go work on a whole bunch of stuff that could be pin compatible or binary compatible, whatever it is. Were just not a share company, were about solving problems that we should solve, that only we can solve, or that were just singularly best to solve. Notice the way Ive explained everything so far is exactly that and theres no inconsistency between celebrating the success of our partners, even if its in an abstraction layer where we are further below, versus in conserving our own energy to go work on things that only we should do and we can do, its just not a conflict to me. Im really happy when AWS does well, Im really happy when OpenAI does well and this is our attitude about our partners and ecosystem, whether they use CUDA or not, it just doesnt matter to us. CUDA has great value to us, whether it has great value to somebody else or not, so what? The most important thing is that theyre able to do whatever it is theyre trying to do as easily as possible, as cost effectively as possible. If one of our partners is the best answer or if even somebody else is the best answer, so be it. We move on.

Centralized vs. Localized Compute

One other big question I have about AI generally is the question of centralized versus local. Obviously, centralized has huge advantages in terms of the compute-to-command and scalability. At the same time, there are costs attached to it and real controls that people may not always like. Meanwhile, we have seen models that run locally for Stable Diffusion in terms of image generation and then

these past few weeks

Metas LLaMA model for language. Do you think that we will see meaningful generative AI applications run locally? Number two, do you see much of a market opportunity for Nvidia there? Right now the best option is obviously an Nvidia gaming GPU, get a 4090. Should there be a consumer AI GPU?

JH:

Inference will be the way software is operated in the future. Inference is simply a piece of software that was written by a computer instead of a piece of software that was written by a human and every computer will just run inference someday. Every computer will be a generative AI someday. Why look up an answer if you already know the answer? Every question that you ask me, if I have to go look it up or go find a bunch of friends and caucus and then come back and give you the answer, that takes a lot more energy than whats already in my brain, and just call it 25 watts. Im sitting here producing answers all day long

Producing answers for the last hour, which I appreciate!

JH:

Right, so this is completely generative AI. Generative AI is the most energy-conserving way to do computing, theres no question about that. And of course the question is when can we do that on a large scale? Well, Ben, back in the old days, in order to run OpenGL, it started out in the data center in a Reality Engine, and then it was $100,000 $150,000 workstation and now youve run OpenGL on a phone. The same exact thing is going to happen with inference. Today, large language models, the largest ones requires A100 HGX to run, so thats a couple of hundred thousand dollars. But how long would it be before we can have smaller versions of that, and quite performant versions of that running on cell phones? No more than ten years. Were going to run inferences literally everywhere of all different sizes. Distributed computing is very cost-effective, its not going to go away. However, in the future, youll do some inference on the phone, youll do some inference on your PC, but youll always have the backup, youll always be connected to a cloud model which is much more capable as a backup to the smaller version on the device and so I have a lot of confidence that todays computing model is going to remain.

Is that local inference going to be done on Nvidia chips? Or is this an area where Apple has a big advantage? What do you think?

JH:

Well, it would be the people who build devices today and there will be simpler models of course, but these devices are really capable these days, theyll run all kinds of interesting models.

I want the H1 Nvidia chip that I can stick in my computer. Well, it always seemed pretty aggressive for Nvidia to have GTC every six months, but it certainly seems appropriate now given the pace of change.

JH:

Ben, this is a big observation. Remember when Moores Law was advancing at 10 times every five years at ISO power and ISO cost? Its that trend, thats Moores Law, ten times every five years, ISO power, ISO cost. As a result, after 35 years, 40 years of owning PCs, still about $1,000. My first PC was $1,000 from Gateway, its still about $1,000, and the performance is way better. It still plugs into the wall, its still a couple of hundred watts, and the performance is way better, so thats Moores Law.

Unfortunately, youre not going to get ten times at ISO power or ISO cost this next five years, not going to happen. Meanwhile, whats interesting is that in the same time, in this last ten years, theres no question AI has advanced a million times, a million times. I mean, how is it possible that we went from AlexNet detecting cats to doing this today? So obviously, computing has advanced tremendously and the way thats happened, of course, is a complete reinvention of how computers write software, the computer architecture of it, and the computer runs software. Every single layer from the chip to the system to the interconnect to the algorithms, all completely redesigned and so this way of doing full-stack computing as you projected out ten years, theres no question in my mind, large language models and these very large language models will have an opportunity to improve by another factor of a million. It just it has to be full stack.

Yeah, I mean thats a perfect way to end it, that is sort of what I was driving at. I love the callback to I remember when the 486 came out or the 386 it was like, Didnt the 286 just come out? Its so much better and so much faster. It does feel that way, that sense of, Wow, this is not slowing down at all. Jensen Huang, it was good to talk to you again and very exciting times.

JH:

Incredibly good to talk to you.

This Daily Update Interview is also available as a podcast. To receive it in your podcast player,

visit Stratechery

The Daily Update is intended for a single recipient, but occasional forwarding is totally fine! If you would like to order multiple subscriptions for your team with a group discount (minimum 5), please contact me directly.

Thanks for being a supporter, and have a great day!

Facebook

Twitter

An Interview with Nvidia CEO Jensen Huang about Manufacturing Intelligence

Monday, March 28, 2022

Nvidia In the Valley

Monday, September 26, 2022

Nvidias Integration Dreams

Tuesday, September 15, 2020

Post navigation

The End of Silicon Valley (Bank)

ChatGPT Gets a Computer

Search for:

Posts

Topics

Concepts

Companies

Stratechery Plus

About Stratechery Plus

Member Forum

Account

Member

Delivery Preferences

Manage Account

Sign Out

Explore Stratechery

Concepts

Companies

Topics