Speaker: 00:00:00

Welcome back to Data Driven, the podcast where we talk about how data

Speaker: 00:00:04

and AI are changing the world. And sometimes we

Speaker: 00:00:07

even understand it. Today's guest is the brilliant Carmen

Speaker: 00:00:11

Lee, CEO of Silicon Data and former Bloomberg brainiac

Speaker: 00:00:15

who's now on a mission to bring financial grade transparency to the wild west

Speaker: 00:00:19

of GPU compute markets. If you've ever wondered how to hedge

Speaker: 00:00:22

your AI infrastructure costs the way airlines hedge fuel, or what

Speaker: 00:00:26

a futures market for GPUs even looks like, you're in for a

Speaker: 00:00:30

treat. Carmen's turning raw compute into a tradable

Speaker: 00:00:33

commodity, normalizing chaos, and possibly building the

Speaker: 00:00:36

Bloomberg terminal for AI infrastructure. Minus the beige

Speaker: 00:00:40

keyboard, we cover everything from tokenomics and TSMC

Speaker: 00:00:44

to why your AI startup's margins are flatter than the earth in a

Speaker: 00:00:47

conspiracy forum. Oh, and there's a used GPU car

Speaker: 00:00:51

lot somewhere in Virginia. Stick around. This one's a data

Speaker: 00:00:55

geek's fever dream in the best way.

Speaker: 00:01:00

Hello and welcome back to Data Driven, the podcast where we explore the

Speaker: 00:01:04

emerging field of data science, artificial intelligence, and

Speaker: 00:01:08

this crazy AI world we live in. But it's all underpinned by data

Speaker: 00:01:11

engineering. And with me, as always, is my favoritest data

Speaker: 00:01:15

engineer in the world. Even my dog is barking, giving you a shout out.

Speaker: 00:01:20

Andy Leonard. How's it going, Andy? It's going well, Frank. How are you?

Speaker: 00:01:24

I'm doing well, I'm doing well. I'm keeping busy.

Speaker: 00:01:28

We were talking about other podcasts that we have and

Speaker: 00:01:32

the other one is Impact Quantum. So go to impactquantum.com

Speaker: 00:01:36

definitely check it out. And had a very fascinating

Speaker: 00:01:40

conversation with our guest in the virtual green room. So without

Speaker: 00:01:43

further ado, let's welcome Carmen Lee to the show. She's

Speaker: 00:01:47

the CEO of Silicon Data and she is driven by a

Speaker: 00:01:51

passion for developing and delivering cutting

Speaker: 00:01:55

edge derivative products and data solutions that

Speaker: 00:01:58

provide essential data, intelligence and efficiency to compute

Speaker: 00:02:02

markets worldwide. Her company's vision is to

Speaker: 00:02:05

revolutionize these markets through unparalleled data transparency

Speaker: 00:02:09

and financial innovation. Welcome to the show, Carmen.

Speaker: 00:02:13

Thank you. You deliver up my tagline so well I might want to

Speaker: 00:02:17

hire you to do the whatever. Thank you. This is like.

Speaker: 00:02:21

Thank you. This is like I was looking the other day. This is almost our

Speaker: 00:02:23

400th show, so I do have a face for radio and

Speaker: 00:02:27

apparent thankfully. But a voice for radio. So good for me.

Speaker: 00:02:31

This is great. And speaking of radio, we were geeking out because

Speaker: 00:02:35

I started my career in New York in finance

Speaker: 00:02:39

and Bloomberg. Having a Bloomberg terminal on your desk was

Speaker: 00:02:43

a status symbol. There were the ones who had it and the ones who didn't

Speaker: 00:02:46

and the ones who wanted it. And you know radio,

Speaker: 00:02:50

right? Bloomberg radio, which we also get here in dc. And you used to work

Speaker: 00:02:53

for Bloomberg, so that's really cool. That's right. I had a great time

Speaker: 00:02:57

working for Bloomberg and my team was part of the

Speaker: 00:03:00

data team I thought is

Speaker: 00:03:04

one of the most cutting edge data company especially in the

Speaker: 00:03:08

financial services industry. Back then I cover all content,

Speaker: 00:03:12

all product data integrations with any third

Speaker: 00:03:16

party ecosystems. So think about any training

Speaker: 00:03:20

cycles from Fedmin back offices, think about any

Speaker: 00:03:24

cloud providers and database Systems and

Speaker: 00:03:27

even AI, LLMs, whatever you call them,

Speaker: 00:03:31

different use cases, real time

Speaker: 00:03:35

reference data, aesthetic data, anything. It's

Speaker: 00:03:38

really fascinating. I learned a lot my background before that I

Speaker: 00:03:42

was in all financial services and I don't know if I bore your audience at

Speaker: 00:03:46

this point. I started my career in trading, high frequency trading

Speaker: 00:03:50

in Chicago. So to me transparency,

Speaker: 00:03:53

efficiency and free market is sort of in my blood.

Speaker: 00:03:57

100% brainwashed at this point in life. So one of

Speaker: 00:04:01

the things I noticed when I was a Bloomberg is there's a

Speaker: 00:04:05

lot of interesting ecosystem

Speaker: 00:04:09

platform came up last year, right? So they all leveraging gen

Speaker: 00:04:13

AI. You're the first few adopters which is good for them

Speaker: 00:04:17

and their client basis sometimes can be financial institutions. So boom, client

Speaker: 00:04:21

basis. So one of the things I noticed is it was a really fascinating conversation.

Speaker: 00:04:24

So those startups, they're gaining a lot of tractions. Good for them. So

Speaker: 00:04:28

obviously I was like oh you're doing so well. And they will complain to me

Speaker: 00:04:31

saying that they were sassed, right? They were 100% SaaS

Speaker: 00:04:35

revenue so static and then it's pivoting to

Speaker: 00:04:39

AI driven SaaS. So their cost, think about last year The

Speaker: 00:04:42

GPU per GPU per hour was like $9 or

Speaker: 00:04:46

6, 7, 9. Back to like 3 if

Speaker: 00:04:50

you own interruptible instances, right? So the swing is like

Speaker: 00:04:53

300% within the same day but then their revenue

Speaker: 00:04:57

is static, right? So their margin like

Speaker: 00:05:00

positive 40% to negative, 60 to

Speaker: 00:05:04

positive and there's no way for them to manage it. And also

Speaker: 00:05:07

same time it's not like they bring on more clients. They

Speaker: 00:05:11

can enjoy the scalability. It's like again

Speaker: 00:05:15

same thing, the margin is uncontrollable and they have this problem say how

Speaker: 00:05:18

do they actually coming out a cash flow plan for next year and then

Speaker: 00:05:22

they obviously complain. Totally strikes me to be

Speaker: 00:05:25

hey, this industry needs financial

Speaker: 00:05:30

infrastructure layer, right? It's almost like talking to American

Speaker: 00:05:33

Airlines. Say hey airline, you cannot hatch your oil prices

Speaker: 00:05:37

fluctuation. How are they Going to price their tickets. They can't, right?

Speaker: 00:05:40

And it's not like American Airline cost OPEX in like give me five year long

Speaker: 00:05:44

contract. They don't do that. Every single of those commodities

Speaker: 00:05:48

pricing discovery and hedging happens in divers market. So

Speaker: 00:05:52

futures options because there's a few reasons, right? Number one is it's just

Speaker: 00:05:55

efficient. Number two is cheap. Both is flexible and then you and me,

Speaker: 00:05:59

we can do the same thing. We have oil exposures, we don't have to be

Speaker: 00:06:02

American airline. But today if you are crowing for

Speaker: 00:06:06

hyperscalers, you can go to those, you know,

Speaker: 00:06:12

whoever, right? Produce chips, right? And get a long term contract.

Speaker: 00:06:16

But you, if you and me start Neo Cloud, guess what? We don't have access

Speaker: 00:06:19

to kind of pricing. It's not good. We you have a

Speaker: 00:06:23

few players who have the pricing, who have that way to hedge it. But

Speaker: 00:06:27

then the smaller Prius just couldn't get in the game, right? It's really not good

Speaker: 00:06:30

for the ecosystem's health performance and

Speaker: 00:06:34

the risk management. So that's really struck my core.

Speaker: 00:06:38

Last year I was like man, someone needs to do the

Speaker: 00:06:41

index, the pricing, the benchmarking layer of the

Speaker: 00:06:45

GPU compute as human resource

Speaker: 00:06:49

I feel like will be the biggest human resource in the next few years.

Speaker: 00:06:52

Surpass all energy combined, right? So that's why I left Bloomberg right

Speaker: 00:06:56

away. Super passionate. I think we can bring so much transparency to

Speaker: 00:06:59

ecosystem will benefit everybody, right? Not only benefiting other people,

Speaker: 00:07:03

needs compute benefiting like you know, the end consumers. Because think about

Speaker: 00:07:07

the whole funnel, right? You had finance and gpu the

Speaker: 00:07:10

actual clusters cost, right? So

Speaker: 00:07:14

if the banks don't have enough information or hedging for the

Speaker: 00:07:17

banks then they have to charge you high interest, they have no other way. Or

Speaker: 00:07:21

you have to look for alternative capitals which traditionally

Speaker: 00:07:25

they're more expensive, right? Because they're not banks. Banks are cheap as a

Speaker: 00:07:29

cost of capital, right? So then the cost from you

Speaker: 00:07:33

know, stage zero is high. Then think about the second stage, third

Speaker: 00:07:36

stage and then people like you and me using Sora with OpenAI

Speaker: 00:07:41

everything will be more expensive because of that, right? So fix the problem

Speaker: 00:07:45

with transparency from this from Gecko is really really

Speaker: 00:07:48

critical and then their benchmarkings and encourage the

Speaker: 00:07:52

secondary markets and all those flexibility and then

Speaker: 00:07:57

availability will be really incredible to benefit the whole

Speaker: 00:08:01

ecosystem. Interesting. So is it fair to say you've built basically a

Speaker: 00:08:04

futures market for GPU. Compute I building a

Speaker: 00:08:08

benchmark index layer. We are working with future exchange,

Speaker: 00:08:12

right? So I'm not a futures exchange so that would be something we

Speaker: 00:08:15

will think about S and P. Right. So they license the index with a. Right,

Speaker: 00:08:19

right, right, right, right. That's what we do. Right. Well, we will index

Speaker: 00:08:22

to an exchange and they will have futures options on top of that and other

Speaker: 00:08:26

financial products. That's a fascinating concept because like

Speaker: 00:08:30

you're right, we need that because the scarcity

Speaker: 00:08:33

of GPU compute is a real issue. It comes up.

Speaker: 00:08:37

And if, if, if Amazon, the rate. Of volatility, how do

Speaker: 00:08:41

you. With, with. With like 40,

Speaker: 00:08:44

60% fluctuation every daily volatility and then it's

Speaker: 00:08:48

just not a, a very transparent market

Speaker: 00:08:52

which is. Breeds inefficiency. Right.

Speaker: 00:08:57

Absolutely. So for those of. Oh, sorry. Go ahead,

Speaker: 00:09:00

Andy. Okay. I was just going to ask. So are you tracking

Speaker: 00:09:04

features and functionality and all of that? That, that would be the. How you value

Speaker: 00:09:08

the GPU itself and compare that to the price and

Speaker: 00:09:12

you're coming up with some ratio. Exactly. So

Speaker: 00:09:16

compute is not like. Unfortunately it's not as easy as electricity

Speaker: 00:09:20

or even oil have different grid. Right. So even 100

Speaker: 00:09:23

has different configurations. Right. They all, it's not the same. Right.

Speaker: 00:09:27

Different CPUs, different RAMs and geolocation matters.

Speaker: 00:09:31

Right. So a lot of things. So normalization become very critical

Speaker: 00:09:35

component to financially settle index.

Speaker: 00:09:40

Right now we have H100A100 indexes published at Bloomberg and Refinitiv.

Speaker: 00:09:44

So the way we do it is we have a base case and all

Speaker: 00:09:48

the factors normalize to the base case. And the way we normalize

Speaker: 00:09:52

historical data, what factor is actually important to the users,

Speaker: 00:09:56

the CPU matter? How much does it matter? What's the wave, whatever it

Speaker: 00:09:59

contributes. How often do we calibrate? Maybe it matters

Speaker: 00:10:03

today, maybe tomorrow. This, this particular. Whatever

Speaker: 00:10:07

inputs value more. Right. So we do calibration,

Speaker: 00:10:11

period of calibration as well.

Speaker: 00:10:14

Interesting. Yeah, it's fascinating to kind of see because I mean

Speaker: 00:10:18

it always seemed like there's something missing around

Speaker: 00:10:23

the GPU market. Right. Because it's just. And I also think too

Speaker: 00:10:27

it's been a while since we had any kind of compute limitations on what we

Speaker: 00:10:31

wanted to do. Right. Like that CPU is like. Yeah, it's cheap

Speaker: 00:10:34

and you can get what you want and it's not supply demand kind of shifting.

Speaker: 00:10:38

Yeah, I agree. Right. So I didn't really think of like,

Speaker: 00:10:42

you know, kind of this, this market kind of response to

Speaker: 00:10:46

it, which I think is, is an interesting approach and I think, I think,

Speaker: 00:10:49

I think it's fascinating. Yeah. Even if you think about

Speaker: 00:10:53

AI SaaS company. Right. I don't know if you heard the saying that

Speaker: 00:10:57

SAS is 80% margin AI SaaS is 0% Mar.

Speaker: 00:11:03

So I mean it depends on how you run your workflow. If

Speaker: 00:11:06

you are not being thoughtful, right.

Speaker: 00:11:10

You just dump everything, everything you need to do into the most

Speaker: 00:11:14

expensive closed source model. And you're not

Speaker: 00:11:18

optimizing your thinking tokens, your input tokens,

Speaker: 00:11:22

output tokens. It can get very pricey very

Speaker: 00:11:25

quickly, right? Not batching it, you're not doing all the right things.

Speaker: 00:11:29

And even you do all the right things, it's gonna be such a meaningful

Speaker: 00:11:34

percentage of your cost. And then all those companies not ready for it. Right.

Speaker: 00:11:38

Because in before what's the raw material cost?

Speaker: 00:11:42

Electricity. Like really nothing. Right. But now

Speaker: 00:11:47

every company becomes, you know, a which is great company

Speaker: 00:11:50

but then their cost structure is shifting from zero cost

Speaker: 00:11:54

to. To 40%, 60%, any percent to token

Speaker: 00:11:58

or to GPU at the end. Right? Right. So how do you think about hedging

Speaker: 00:12:02

that kind of cost component? Can you control that? Can you optimize for it? Can

Speaker: 00:12:05

you monitor it, can you benchmark it? You know, can you hedging it? So

Speaker: 00:12:10

no, that's a good point. So do you think

Speaker: 00:12:14

there's multiple, I guess, inputs and levers to this? Right. Because it doesn't seem like

Speaker: 00:12:17

this would be a straight thing. So what's, you know, Andy mentioned that you were

Speaker: 00:12:21

tracking certain benchmarks. Like what benchmarks are you tracking? Because I'm very curious about

Speaker: 00:12:25

this. Right? So there's a few things, depends on

Speaker: 00:12:28

your position at least this can change

Speaker: 00:12:32

every single day. Like our ecosystem is so nuts, right? So it

Speaker: 00:12:36

depends your

Speaker: 00:12:39

positioning, the whole workflow, right. So think about if you are new

Speaker: 00:12:42

clouds, you are selling token, right?

Speaker: 00:12:47

The cost for you is a gpu, right. So then your margin

Speaker: 00:12:50

becomes the diff between the margin and the GPU cost. And that's

Speaker: 00:12:54

the way we calculate it, right. Which is different units.

Speaker: 00:12:58

And then your worry is okay, so for

Speaker: 00:13:02

token survey for the tokens, how much money can I get rate

Speaker: 00:13:06

from one particular gpu? The flops, right? How can I optimize for that? And

Speaker: 00:13:10

what if I'm doing even hosting open source models?

Speaker: 00:13:13

And how do I make sure people using that open source model, should I

Speaker: 00:13:17

shifting it? What's the pricing for that? Think about that strategy and GPU said

Speaker: 00:13:21

okay, am I renting GPUs? I'm like outright

Speaker: 00:13:24

purchase those GPUs and put on my books and depreciate it. How

Speaker: 00:13:28

long can I depreciate it for? How do I let's say if

Speaker: 00:13:32

everyone's the latest and greatest, I'm selling the GPU after second, third year,

Speaker: 00:13:36

what's the terminal value for the GPUs? Who should validate that? Which bank should

Speaker: 00:13:40

depreciate the asset classes. So it's a lot of things coming to the new

Speaker: 00:13:43

cloud space. If you think about your inferencing infrastructure,

Speaker: 00:13:47

right? So let's say you're

Speaker: 00:13:51

AI tech company, right? Then your revenue is token,

Speaker: 00:13:54

right? Ideally they're paying you based on token

Speaker: 00:13:58

use cases as well. And then your cost is token which is

Speaker: 00:14:02

easier but same time for you is thinking through okay so

Speaker: 00:14:06

right now open source tokens, the price

Speaker: 00:14:10

they do move up and down. For example

Speaker: 00:14:14

if you look at Deep Seq, even Deep Seq, they host their own servicing but

Speaker: 00:14:18

then the price changes, they have the off peak hours and that change all the

Speaker: 00:14:21

time. Or you can do closed source which the price is pretty

Speaker: 00:14:25

static. The way I think about it is again it's extremely

Speaker: 00:14:29

free market approach, right? Is how can we

Speaker: 00:14:33

make sure especially open source ones, the token prices

Speaker: 00:14:37

is driven by the market demand supply curve,

Speaker: 00:14:41

right? Let's say if everyone, if I have like 100 GPUs

Speaker: 00:14:45

right now and obviously let's say I

Speaker: 00:14:49

choose to host only one llama open source

Speaker: 00:14:52

model and then I know I can produce X amount of tokens,

Speaker: 00:14:56

both input output tokens, right? And I can just auction off

Speaker: 00:15:00

and you guys and you can buy a million token and one day he's like

Speaker: 00:15:03

I'm not going to use it, why do I sell it to Frank? Can this

Speaker: 00:15:06

be some market where right now you are stuck

Speaker: 00:15:10

with it, right? In

Speaker: 00:15:14

my mind, unfortunately I'm very brainwashed to free market. I feel like you have to

Speaker: 00:15:17

give people option. The more option you give people and

Speaker: 00:15:21

any have flexibility, franchise flexibility and people more

Speaker: 00:15:25

willing to participate because they know they can get out. Because right now you're stuck

Speaker: 00:15:28

with hyperscaler GPUs or any tokens, you're stuck with it

Speaker: 00:15:32

and then you're less likely to commit because you know you

Speaker: 00:15:36

can get out or you get fined even worse, right? You know those cases, you

Speaker: 00:15:39

get fined millions of dollars when you back out on cloud deals.

Speaker: 00:15:43

That's one of the things I really think I should encourage people thinking about tokens

Speaker: 00:15:47

and GPUs as a main cost structure. How can we drive

Speaker: 00:15:50

efficiency so people can commit and then get out if

Speaker: 00:15:54

they need to and then swap out and everyone gets more value

Speaker: 00:15:58

and efficiency from those transactions. So is it

Speaker: 00:16:02

more like an exchange or an auction?

Speaker: 00:16:06

What's the mechanism? Right? So from token GPU side

Speaker: 00:16:09

obviously there's Spot exchange already like compute

Speaker: 00:16:13

exchange, where you can actually tell them, hey, I need this

Speaker: 00:16:17

configuration how many nodes? And then they will

Speaker: 00:16:20

say okay, let's do an auction. And then the

Speaker: 00:16:24

best price, best quality, whatever combination wins. Right?

Speaker: 00:16:27

Yeah. You can potentially do other asset class as well. Right. So we're.

Speaker: 00:16:31

Siliconita is a data company. So think about us as the Bloomberg and there's the

Speaker: 00:16:35

Nices, NASDAQqs and everybody, right. This spot, right,

Speaker: 00:16:39

you can actually get GPUs. You, you can actually get stocks from those exchanges.

Speaker: 00:16:43

And the FAST is we collect data from those exchanges like

Speaker: 00:16:46

Bloomberg. Right. And then we'll produce financial products on top of that. Right.

Speaker: 00:16:50

So that's right, there's spot, which is the

Speaker: 00:16:54

nasdaq. Right. You can buy and sell, get actual physical

Speaker: 00:16:57

deliveries, all the compute or token you need. And there's

Speaker: 00:17:01

data side which is making data the Bloomberg. Right. And then FAST

Speaker: 00:17:05

is structurally the financial products layer. Right, data layer. And

Speaker: 00:17:09

then we're agnostic, meaning we look agnostic of chips,

Speaker: 00:17:12

agnostic of spot markets, agnostic of everything. Right.

Speaker: 00:17:16

And it's a future exchange which they license

Speaker: 00:17:19

our indexes to create futures product. Ideally we're

Speaker: 00:17:23

settling to spa. Maybe some of them will sell at spa. Right. So it's pretty

Speaker: 00:17:27

standard practices. So

Speaker: 00:17:31

would the currency or the coin of the realm be tokens

Speaker: 00:17:35

or compute time or compute seconds? Things

Speaker: 00:17:38

change. It's, it's making my life really fun

Speaker: 00:17:42

and you know, also different. Yeah, all the time.

Speaker: 00:17:46

And then you, you mentioned you have this quantum thing, right? Right.

Speaker: 00:17:50

It's a lot. We track all compute. So it doesn't for us

Speaker: 00:17:54

what chips and what, what architecture framework

Speaker: 00:17:58

and you know, we don't really care. We benchmark the performances and the data

Speaker: 00:18:01

inside. And everything we don't know for us is getting

Speaker: 00:18:05

ready for everything. So we want to create product

Speaker: 00:18:08

that's actually going to be helpful to the marketplaces, not just creating

Speaker: 00:18:12

things like gambling table. People bet on binary things. Right. For

Speaker: 00:18:16

us, how can we make it useful for the people who actually

Speaker: 00:18:20

naturally long compute? So the Neo clouds everybody else,

Speaker: 00:18:24

they need product to hedge their revenue fluctuation. Right.

Speaker: 00:18:27

So they issue short futures and whoever naturally short compute.

Speaker: 00:18:31

So you need computer and for you is a cost management.

Speaker: 00:18:35

So I want to make sure my product is usable by them. It

Speaker: 00:18:39

depends on how they pay. Right. If they pay tokens,

Speaker: 00:18:42

nothing to create token products. You're very right now paying people paying

Speaker: 00:18:46

per GP power and you create product for that. If they pay

Speaker: 00:18:50

things all right, then it's different contracts for that.

Speaker: 00:18:53

So it really depends on how people using it today and tomorrow.

Speaker: 00:18:57

And then, you know, we. We hyped to create products that may

Speaker: 00:19:01

not be the S&P 500, which live forever. We probably create financial

Speaker: 00:19:05

products live for next five to 10 years. Because guess what? Chips

Speaker: 00:19:08

what our style, right? The A100 people still using it, but

Speaker: 00:19:12

like L4s, people are using it, but like other chips like the V's,

Speaker: 00:19:16

the, you know, probably not as much. Right. Then similarly, my

Speaker: 00:19:19

financial products associated with that underlying asset

Speaker: 00:19:23

probably will, you know, retire, be retired. Right? Which is fine.

Speaker: 00:19:28

That's cool. I'm sorry, go ahead, Andy.

Speaker: 00:19:31

I was just thinking about it and a couple of ideas popped into

Speaker: 00:19:35

my head as you were describing that, Carmen. One is

Speaker: 00:19:39

capacity. It sounds like you're literally selling

Speaker: 00:19:42

compute capacity, GPU capacity, time, just

Speaker: 00:19:46

whatever. But it kind of falls into that bucket under one hand.

Speaker: 00:19:50

But then on the other hand, it seems like that

Speaker: 00:19:54

it almost creates this utility market.

Speaker: 00:19:59

Is that fair or am I missing something, right? No,

Speaker: 00:20:03

you're right. But two pieces. So one is a compute exchange part, right? This is

Speaker: 00:20:06

where you can actually get either depends on what people,

Speaker: 00:20:10

the mode of people preferences. You can get GPUs or get

Speaker: 00:20:14

tokens, whatever, right. Physically delivered, you do you. You

Speaker: 00:20:18

don't have to touch any financial products, right. It is literally like you going to

Speaker: 00:20:22

a store buying stuff. And then the more option based, right.

Speaker: 00:20:25

You can actually get instances. And the silicon data is. You

Speaker: 00:20:29

cannot actually getting any compute. Right? Like you cannot

Speaker: 00:20:33

get any stocks from Bloomberg. Well, you can get this data.

Speaker: 00:20:37

What asset is trading, what prices? So that informal decision, ideally

Speaker: 00:20:40

in your spot market be like, hey, I think everyone, you know,

Speaker: 00:20:44

the H100 price is a little too high, in my opinion. I'm not going to.

Speaker: 00:20:47

Right. Right now, like, forget about this. And I can totally use a

Speaker: 00:20:51

100. Right. It's fine. So this data is data

Speaker: 00:20:54

layer, which is liquid data, right? So those are those the

Speaker: 00:20:58

sort of two pieces to I guess resolve the

Speaker: 00:21:02

workflow equation. So it's kind of like when you go to the supermarket. I'm

Speaker: 00:21:05

sorry, Andy. When you go to. That's okay, go ahead. When you go to the

Speaker: 00:21:08

supermarket, you buy the beef, you buy the pork, but you don't think about the

Speaker: 00:21:11

pork belly futures and stuff like that. It's kind of abstracted away from you.

Speaker: 00:21:15

Exactly. The farmers will think about this, right? Yeah, farmers think about it.

Speaker: 00:21:18

Yeah. They need to hatch the corn futures, right? But if

Speaker: 00:21:22

you are farmer, you still say you were someone to eat the

Speaker: 00:21:26

corn. You go supermarket, you don't think about, hey,

Speaker: 00:21:30

Right. So you may have covered this already, but how does

Speaker: 00:21:34

or does fungibility come into play?

Speaker: 00:21:37

It's a great question. So I went through so many different iterations about this.

Speaker: 00:21:43

Initially I was like, okay, why don't I just normalize across flops? And I was

Speaker: 00:21:47

like, nope, can't do that because there's

Speaker: 00:21:51

just, there's so many things wrong with this approach. But obviously

Speaker: 00:21:55

we can dig into details, but we're not going to do that. And then secondly

Speaker: 00:21:58

is okay, why don't we do like inferencing

Speaker: 00:22:02

chips? Like just make a pot and then we realize, okay, how can.

Speaker: 00:22:06

So again back to the initial question. I want to make product actually going

Speaker: 00:22:10

to help people hedging. Right. If you

Speaker: 00:22:13

do a combination of different chips, then if you

Speaker: 00:22:17

are and you know we're using of a lot of people, are you going to

Speaker: 00:22:20

really use that to hatch? How would some correlation look like. Right.

Speaker: 00:22:23

Maybe you just rather have different chip types and then just hatch accordingly

Speaker: 00:22:27

because the correlation will be much higher than the combination of indexes.

Speaker: 00:22:31

Maybe the composition of indexes is good for just tracking

Speaker: 00:22:35

general, but not for actually financial products. So we have, we have,

Speaker: 00:22:39

we can have all. Some of them will be tradable. Some of them. Well, right.

Speaker: 00:22:43

For us is if people start, if, if we move to the world

Speaker: 00:22:47

where it's not going to be Nvidia only kind of play

Speaker: 00:22:50

in the like amd. We can eventually,

Speaker: 00:22:54

it'll probably end eventually. Well, we'll see when, right? We'll

Speaker: 00:22:58

see quantum happens first or everyone catching up first. I have no

Speaker: 00:23:01

idea. Right. So if it's like a more vibrant

Speaker: 00:23:05

ecosystems. Right. And then maybe we're thinking about, hey, maybe we can do

Speaker: 00:23:09

like doing some of the chips. Even different firms would normalize it and then we

Speaker: 00:23:12

do something like a inferencing chips, chaining chips. I don't

Speaker: 00:23:16

know. So that's another thing. Or like token, token indexes. Right.

Speaker: 00:23:20

So can we do open source ones? Multimodality? Is

Speaker: 00:23:23

multimodality going to be a thing in a few years? Everything going to go back

Speaker: 00:23:27

to one model only? Because right now with different models. But maybe it's the interim

Speaker: 00:23:30

stage. Right. We. I don't know. So it's one of the things we have to

Speaker: 00:23:34

keep like looking and thinking and just moving things

Speaker: 00:23:38

forward. Yeah, I was thinking too about, you

Speaker: 00:23:41

know, the, the amortization that people

Speaker: 00:23:45

do in their heads at least when they buy a new car. Yeah. So

Speaker: 00:23:49

that's the math is you drive it off the lot, it's worth what, a 75,

Speaker: 00:23:53

80% of what you pay for.

Speaker: 00:23:58

So we need a Carfax for GPUs, right? So that's what we do too

Speaker: 00:24:02

for silicon Mark. So what we do is okay, everything. Well

Speaker: 00:24:06

at least right now or before Last year or T minus 1, everything

Speaker: 00:24:10

is brand new. So okay, we'll take whatever the

Speaker: 00:24:13

number they published and tdbs, the flops, we all know

Speaker: 00:24:17

there's like haircut to that number.

Speaker: 00:24:20

That's funny, right? And then a year later, right, A year later I

Speaker: 00:24:24

say, Andy, you're growing great in great data centers. Your

Speaker: 00:24:28

thermal cooling was doing great. I'm old data

Speaker: 00:24:32

center, I don't have the latest cooling. Obviously my chip

Speaker: 00:24:35

is after year. You can argue they own different curves,

Speaker: 00:24:39

decay curve. And are we treating the same prices even

Speaker: 00:24:43

though same configuration? Probably we shouldn't. Should it be reflection of

Speaker: 00:24:47

the actual quality? So that's something Mark does.

Speaker: 00:24:51

And then we do things even more basic than that. So number one is

Speaker: 00:24:55

when you tell me you have H100 like 100 nodes, each node has

Speaker: 00:24:58

say 8 GPUs, right? Yeah. Is that true? Can I

Speaker: 00:25:02

number one verify the UID of that? And you see, it's all the CPUs

Speaker: 00:25:06

and this operation systems on all

Speaker: 00:25:09

the nodes, they all live connected. Number one, can we just

Speaker: 00:25:13

verify are they connected? What's the latency? So that's very

Speaker: 00:25:17

basic things, right? So we do that piece at least, you know,

Speaker: 00:25:21

are they truly UIDs and CPUs? The machine, is the machine ever

Speaker: 00:25:25

changed? Because we do mesh IDs based on

Speaker: 00:25:29

CPU changes. We know something changes, right? And then the UID of every

Speaker: 00:25:32

chip. So we do the decay curve for the individual chips and also the machine

Speaker: 00:25:36

level and then thermal staggeration, everything. So we do

Speaker: 00:25:40

that and then we do validation. Almost like Bloomberg Validate fixing

Speaker: 00:25:43

compound. Because you have to understand the issuers and it's

Speaker: 00:25:47

a bridge and it's a school and with cash flows and all those stuff. So

Speaker: 00:25:50

we do that for GPUs. The geolocation. If you build data

Speaker: 00:25:54

centering somewhere in North Korea,

Speaker: 00:25:58

it's great, but no one going to use it, right?

Speaker: 00:26:02

We took all those in considerations when we created those data models. So then

Speaker: 00:26:06

we figured out, okay, so based on the setup and

Speaker: 00:26:10

we run a benchmark on specific GPUs, this is our grade and then

Speaker: 00:26:13

this is our validation. Obviously you can do whatever you want. And then you can

Speaker: 00:26:16

say hey, screw that, I believe this is much higher price. You can do that

Speaker: 00:26:19

as well, Right? But this is our valuation. So almost like a scoring system.

Speaker: 00:26:24

That's interesting. So My mind immediately went

Speaker: 00:26:27

to, when, when we started talking about cars, my

Speaker: 00:26:31

mind immediately went to, you know, the used GPR lot

Speaker: 00:26:35

some guy in bib overhauls out here in Farmville, Virginia

Speaker: 00:26:39

kicking the tires. What's it going to take to get you into this

Speaker: 00:26:42

gpu?

Speaker: 00:26:49

Yep. See, there we go. And network them together. Right. Like I think there's also,

Speaker: 00:26:53

you know, maybe, you know, I don't know if

Speaker: 00:26:57

you've been tracking the, the DGX Spark device

Speaker: 00:27:01

that Nvidia has, but apparently they have ports

Speaker: 00:27:05

in them so you can network I think up to four together. I'm not sure

Speaker: 00:27:08

but yeah, I'm sorry I

Speaker: 00:27:12

cut you off but like. No, no, no. Nvidia we

Speaker: 00:27:16

definitely leveraging a lot of. So we do the container within

Speaker: 00:27:19

container and we do integrate with Nvidia DGX

Speaker: 00:27:22

benchmarking. So they have open sourced some of their LM

Speaker: 00:27:26

benchmarking based on GPUs and we do streamline their products so

Speaker: 00:27:30

you can test lms. So Nvidia Digitex testing

Speaker: 00:27:33

through system data. The benefit is if you do it all right yourself,

Speaker: 00:27:37

number one, you can

Speaker: 00:27:41

obviously people want but people can just change up the, the

Speaker: 00:27:45

benchmark results themselves. Right? It's open source but through us it's data Oracle. You

Speaker: 00:27:48

can't really change results. Number two things is more streamlined. It takes a few hours

Speaker: 00:27:52

to run versus take weeks because you've download a bunch of things you may or

Speaker: 00:27:55

may not need. You may or may not need.

Speaker: 00:28:00

Well, I also think too like, you know, how does this, you know, you

Speaker: 00:28:03

mentioned you, you kind of skirted around the location thing with sovereign

Speaker: 00:28:07

AI, right? So like if I'm okay with using Google

Speaker: 00:28:11

Services, right. And I can, I have access to TPUs, right. I have a lot

Speaker: 00:28:14

more access to whatever Amazon's chip. Microsoft I think is

Speaker: 00:28:18

working on something. Custom that's on prices too, right. The Geolocation

Speaker: 00:28:22

they have different prices and different carbon footprint. We haven't even touched that.

Speaker: 00:28:26

Right, right, right. We do track that as well based

Speaker: 00:28:30

on local grid power grid information. We do track the carbon cost associated with

Speaker: 00:28:33

different AI workflows. I think it's important, I think so

Speaker: 00:28:37

for me is let me at least surfacing the number to you and

Speaker: 00:28:41

you decide what to do with it. Right. So I think that's a good idea

Speaker: 00:28:45

or you know, maybe it turns out that you know,

Speaker: 00:28:49

this type of model of GPU is you know, depending on what your

Speaker: 00:28:52

core. I think it's, I think it's great because I think one of the things

Speaker: 00:28:55

that I've heard And I didn't Peter

Speaker: 00:28:58

Drucker. What gets measured gets managed, right? So you're, what you're doing is you're providing

Speaker: 00:29:02

ways to measure GPUs and GPU performance. Right.

Speaker: 00:29:06

So if I don't care. One of the things I heard about and I'm sure

Speaker: 00:29:09

you have some thoughts on this is like cloud providers that are

Speaker: 00:29:13

starting up and they're just doing

Speaker: 00:29:17

GPUs, right. They're just doing kind of training loads. Right.

Speaker: 00:29:21

And they don't need to be located anywhere special. Right. Like they don't

Speaker: 00:29:25

need to be in the northeast corridor. They could be in the middle of

Speaker: 00:29:28

nowhere as long as they have power. Right. And

Speaker: 00:29:32

because you're going to run a load, right, you're going to run a load on

Speaker: 00:29:35

the thing, it's going to take 72 hours say to run. You don't really care

Speaker: 00:29:38

if the latency is, you know, 150 milliseconds versus

Speaker: 00:29:42

3. Right. It doesn't really matter. Yes.

Speaker: 00:29:46

That's why you see a lot of us get built up in like Iceland, Finland,

Speaker: 00:29:50

the users can be in Americas, can be in Asia. Right,

Speaker: 00:29:55

right. For them is can they get the capacity

Speaker: 00:29:58

looking for and you hard deal if you're thermal powered

Speaker: 00:30:02

data centers, cheap electricity. Yeah.

Speaker: 00:30:06

And then it's cleaner supposedly. Right. As

Speaker: 00:30:10

long as you're not on the volcano belt.

Speaker: 00:30:13

Right. As long as it's not going to blow up. Yeah.

Speaker: 00:30:18

But yeah, so we definitely see that trend and a lot of energies, you

Speaker: 00:30:22

know, what do we call it oversupply sometimes can

Speaker: 00:30:26

be in Spain because overbuilt and the grid couldn't handle it. And

Speaker: 00:30:29

then they need to get data center up and running like now to take over

Speaker: 00:30:34

the power. But then

Speaker: 00:30:37

it takes a lot to make the racks start running. Right.

Speaker: 00:30:41

More than just the GPU itself, you need the connectivities and network

Speaker: 00:30:45

and that could be in shortage. So you need to solve a lot of different

Speaker: 00:30:48

pieces to actually deliver the actual computer.

Speaker: 00:30:52

But that's why it's fascinating industry for us because

Speaker: 00:30:56

we see things from dsml, tsmc, side.

Speaker: 00:31:00

So anything supply demand shifting will have

Speaker: 00:31:04

an impact on the whole ecosystem. And then this industry is winner takes off

Speaker: 00:31:09

from LTSMC to a solution level.

Speaker: 00:31:13

You have to be the solution. Your alternative solution just not

Speaker: 00:31:17

going to work. So. So every single piece is so critical to

Speaker: 00:31:20

the whole chain packaging. Right. You have to work,

Speaker: 00:31:24

right. If you don't know how to do it, then you just can't do it.

Speaker: 00:31:27

It's not like you can buy a cheaper pair of socks or whatever

Speaker: 00:31:31

so we do. We're from end to end, right. From the SM of production,

Speaker: 00:31:35

tsmc. So we're official TSMC partners are going to be actually

Speaker: 00:31:39

TSMC conferences to this

Speaker: 00:31:43

November. Very cool. It is really cool. I

Speaker: 00:31:47

kicked out by those stuff very quickly. And all the way to

Speaker: 00:31:50

the model A, the token layer. Right. Agentic layer. So

Speaker: 00:31:54

we sort of see things all the way. Which

Speaker: 00:31:58

I think my brain get overclocked every single.

Speaker: 00:32:04

I know what you mean because I get till the time of like

Speaker: 00:32:07

2:33pm and I'm like, I can't take any more input. Like

Speaker: 00:32:10

and the muscle, my brain muscle just dead. I know. How

Speaker: 00:32:14

do you do that? How do you get a roller in my brain, just like

Speaker: 00:32:17

relax my brain muscles? I. I found going for a walk

Speaker: 00:32:21

is a. Is a good way to do it. Right.

Speaker: 00:32:24

No, like. And a co worker of mine calls it everything turns to

Speaker: 00:32:28

hieroglyphic hieroglyphics when he's like

Speaker: 00:32:31

looking at like stuff. And I was like, yeah, that's a good way to put

Speaker: 00:32:34

it. Because it's just kind of like, yeah, I can't. I don't want to have

Speaker: 00:32:37

time by a daughter. So I usually spend time with my daughters. I feel like

Speaker: 00:32:40

they've been silly. And I would tell them, I'm so stressed out. When my daughter

Speaker: 00:32:44

was like, me too. I was like, what are you stressed about one last donut

Speaker: 00:32:47

than the other guy. I was like, that's very important thing. I agree with that.

Speaker: 00:32:51

That's very stressful. I will be really upset if I get one less

Speaker: 00:32:54

donut. So. Yeah, so definitely put things in

Speaker: 00:32:58

perspective. Yeah, that's cool.

Speaker: 00:33:03

I think one of the best things. Any other questions? No, plenty,

Speaker: 00:33:07

plenty. Like, I'm just fascinated by this. I know

Speaker: 00:33:11

we're kind of short on time, but one of the things that you mentioned was

Speaker: 00:33:14

tcmc. Tsmc.

Speaker: 00:33:18

So for those who don't know who they are and how important they are to

Speaker: 00:33:22

the global economy, could you explain for those folks

Speaker: 00:33:26

and why I was so excited that you're going to one of their conferences? I

Speaker: 00:33:29

didn't know they had conferences, so. I don't think I would do the justice

Speaker: 00:33:33

of explaining how important TSMC is. All right, how about I explain it and

Speaker: 00:33:37

then you tell me where I'm wrong. I'm sure you'll do a better job

Speaker: 00:33:40

than I can. So. Tsmc. Taiwan

Speaker: 00:33:45

Semiconductor Manufacturing Company. That's right.

Speaker: 00:33:50

They are based in Taiwan. And

Speaker: 00:33:54

the reason why. Nvidia. There's a fascinating

Speaker: 00:33:57

story in the book called the Nvidia Way. I Don't know if you've listened to

Speaker: 00:34:00

that or read that book. Really awesome book. But basically

Speaker: 00:34:05

one of the advantages Nvidia had early on and arguably

Speaker: 00:34:09

now was that they off they outsourced their chip

Speaker: 00:34:12

manufacturing to this company tsmc. I'll get it right that

Speaker: 00:34:16

time. They are basically what they call a fab.

Speaker: 00:34:20

And you could, I mean not

Speaker: 00:34:24

now they're so busy like you know, you kind of the you in general. Right.

Speaker: 00:34:27

Like I couldn't call them up and be like hey, I have some prints for

Speaker: 00:34:30

you. I have some chip designs I want you to make for me. Can you

Speaker: 00:34:33

send me. They're not at that scale but

Speaker: 00:34:36

so they're a fab. And so what happens is people like Nvidia, companies like

Speaker: 00:34:40

Nvidia, a few other companies too will go and they will, they

Speaker: 00:34:44

will design their chips and then they'll, they'll basically

Speaker: 00:34:48

not drop ship but effectively kind of print to order

Speaker: 00:34:52

chips. Which frees up a company like Nvidia

Speaker: 00:34:56

from having to build their own fabs. Kind of like intel does. Is that a

Speaker: 00:35:00

good description? 100 so I usually call

Speaker: 00:35:04

on Nvidia and AMD like design houses and then sometimes

Speaker: 00:35:07

confused with people who's like oh, are they like Louis Vuitton was like no,

Speaker: 00:35:11

Right, right. Or like graphic designers? Yeah, yeah. So they're design

Speaker: 00:35:15

houses and then they are Fabless. Right. And intel,

Speaker: 00:35:18

which is interesting because they do both. Right? Yeah, yeah.

Speaker: 00:35:22

Intel like as I was saying that intel doesn't. Yeah, they do both. Yeah.

Speaker: 00:35:26

Right. And then it could be a great strategy. Could work

Speaker: 00:35:30

or. Well, depends on many things. Right then anyways,

Speaker: 00:35:34

so TSMC is like the, as I said before, this

Speaker: 00:35:37

industry, I don't know if it's good or bad but it's a winner takes

Speaker: 00:35:41

all market. Right. So TNC is definitely

Speaker: 00:35:45

the winner for a lot of different

Speaker: 00:35:48

reasons. I think for the leadership, self

Speaker: 00:35:52

and technical team for the whole supply chain ecosystem. The

Speaker: 00:35:55

gravity, all the years, the hard work they've put in.

Speaker: 00:35:59

So it's a position where I don't think anyone

Speaker: 00:36:03

can seriously challenge them

Speaker: 00:36:07

in a meaningful way in the next whatever

Speaker: 00:36:11

years. So they're very critical. And then the good

Speaker: 00:36:14

thing interesting about them, they're the agnostic of design houses,

Speaker: 00:36:18

right. So they have great relationship with Nvidia for sure and I'm sure with

Speaker: 00:36:22

them, with everybody, right. It's their job to

Speaker: 00:36:26

produce those chips and then it's

Speaker: 00:36:29

interesting enough it's aligned with mine. Silicon Data. Because

Speaker: 00:36:32

I'm agnostic of chips, right. So

Speaker: 00:36:36

obviously I want to create products that's most important to the

Speaker: 00:36:40

ecosystem. So right now people care a few chips and

Speaker: 00:36:44

those chips happen to be from one design houses. But let's say

Speaker: 00:36:48

if another design house start picking up a lot of momentum. For me, it's

Speaker: 00:36:51

like, how can I help everybody in ecosystem

Speaker: 00:36:55

compare, contrast hashing, right? Use them benchmarking, normalize

Speaker: 00:36:59

it in a meaningful way. So it's my job to work with all the design

Speaker: 00:37:03

houses. It's their job to produce chips that can be usable for

Speaker: 00:37:06

defunding the houses too. So we're very aligned in that sense. And

Speaker: 00:37:10

anything they do, right? So think about, they are

Speaker: 00:37:14

future looking because they're not thinking about next year or next quarter. They think

Speaker: 00:37:17

about 20 years, 10 years. It takes them five, six

Speaker: 00:37:21

years to build a fab, right? And then they need a fab to

Speaker: 00:37:25

be utilized. And they have a threshold, right? If you're

Speaker: 00:37:29

building a fabric and that's not utilized by year eight,

Speaker: 00:37:32

they plan right now by year a year 10, they are

Speaker: 00:37:36

losing a lot of money. A lot like billions of dollars,

Speaker: 00:37:40

right? Like can you make sure the fab will be utilized, the demand

Speaker: 00:37:44

will be there by year 10. Forecasting from today.

Speaker: 00:37:47

It's very, very, very hard job to do. And it's not

Speaker: 00:37:51

like it's not like a new reim, you know,

Speaker: 00:37:55

like what are minings and all things that you can hedge it, right?

Speaker: 00:37:59

Like there's a way to hatch the future curve. But like it's not like they

Speaker: 00:38:02

can forecast, forecast and do a swap on that because

Speaker: 00:38:06

the market is so concentrated and then very

Speaker: 00:38:10

binary and a huge size. Who's taking the other side?

Speaker: 00:38:13

I don't know. It's very hard over the concentrate to

Speaker: 00:38:17

do so for them is to get clarity supply demand curve in 10

Speaker: 00:38:21

years. I mean they do also edge computing chips as well, not just data

Speaker: 00:38:24

center chips. Right? But how do they think through that? I think that's

Speaker: 00:38:28

really challenging. I think will be really challenging for me

Speaker: 00:38:32

for sure. I'm sure they have way smarter people there to think through those problems.

Speaker: 00:38:36

But yeah, it's an interesting problem to have.

Speaker: 00:38:40

That's why TSMC and I, for example, they sell to

Speaker: 00:38:44

their clients who are in the vds of the world. So they have that kind

Speaker: 00:38:47

of transparency. But what they don't have, which

Speaker: 00:38:51

may be a different indicator for the supply demand curve in

Speaker: 00:38:54

10 years is end users

Speaker: 00:38:58

pricing volatility. Right? And then you know, okay, so if

Speaker: 00:39:03

every single chip, every single chip I produced, right, Data center

Speaker: 00:39:06

quality chips, one dying price, right.

Speaker: 00:39:12

Is the indicator for supply demand shifting. Maybe it

Speaker: 00:39:15

is Maybe it's not right. At least you have some, some data points which your

Speaker: 00:39:19

immediate sales and revenues which is T0

Speaker: 00:39:23

won't give you because then a few degrees removed from

Speaker: 00:39:27

end user experiences you give Nvidia and Nvidia packages it to

Speaker: 00:39:30

AWS and GCP and end users and you and me. Right.

Speaker: 00:39:34

So that's something that for them to think through as well.

Speaker: 00:39:38

Interesting. One of the stories I heard and I

Speaker: 00:39:41

wonder if it's true, was that part of the

Speaker: 00:39:45

reason why there was part of the reason

Speaker: 00:39:49

why Nvidia was able to really capitalize on this. There's a lot of

Speaker: 00:39:53

reasons, but one of them was the fact that in the

Speaker: 00:39:56

crypto craze, the run up to get chips for that Nvidia

Speaker: 00:40:00

had purchased. Now what you said makes a lot more sense now. Nvidia had purchased

Speaker: 00:40:03

the. They basically purchased a certain amount of capacity at TSMC

Speaker: 00:40:08

for like three to four years, something like that. And then that happened to

Speaker: 00:40:11

coincide with the AI boom. Is that, is that true? And

Speaker: 00:40:15

that. I guess that's a market too, right? Like you know, like hey,

Speaker: 00:40:19

so I wasn't. I know so 7 so I'm not following all ASICS

Speaker: 00:40:23

so they have a specific for. For the, for. For the mining

Speaker: 00:40:27

chips. That could be true. So I think

Speaker: 00:40:30

not because I'm straight, I mean and a girl can dream. I'm

Speaker: 00:40:34

strapped to be like, you know, to really

Speaker: 00:40:38

help the industry and then be, you know, like

Speaker: 00:40:42

the company the team hopefully can propel the industry

Speaker: 00:40:45

move forward. Right. I'm strive to point zero over percent people

Speaker: 00:40:50

and then competency is very important. Obviously execution, your

Speaker: 00:40:54

hard work is important. Not a big piece is you have to be

Speaker: 00:40:57

really, really lucky. That is also everyone's control.

Speaker: 00:41:01

And then Nvidia puts so much time effort into everything they do. You can argue

Speaker: 00:41:04

they were really great company even before the AI boom and

Speaker: 00:41:08

everything. But the lock piece and how do you control that? How do

Speaker: 00:41:12

you. How do you know quota gonna be like the piece

Speaker: 00:41:15

that's needed? Right. Well, some.

Speaker: 00:41:19

Someone said that, you know, Jensen Wang is like the epitome of,

Speaker: 00:41:23

you know, the better you, the harder you work, the more luck you have.

Speaker: 00:41:27

True. Like there's a lot to that and I know it's

Speaker: 00:41:30

complicated but like I'm just, I just. It's interesting how the crypto kind

Speaker: 00:41:34

of boom and bust really kind of also

Speaker: 00:41:38

propel us into the AI. Not, not all by

Speaker: 00:41:41

itself but it definitely I think gave. There was some momentum where

Speaker: 00:41:46

no momentum was expected, if that makes sense. Right. Yeah, I agree,

Speaker: 00:41:50

I agree Timing is so interesting, but

Speaker: 00:41:54

we just have to two point like the heart of your world. You have to

Speaker: 00:41:57

do everything you can with the environment. Right? That's

Speaker: 00:42:00

cool. That's cool. All data. So we'll see happens what

Speaker: 00:42:05

I mean. That'S the importance of data. Right. Like, you know, people don't realize that.

Speaker: 00:42:08

And I go calling back to Bloomberg. So I'm referring to Michael Bloomberg,

Speaker: 00:42:12

former mayor of New York. But before he was mayor he

Speaker: 00:42:15

basically started a company called Bloomberg. And

Speaker: 00:42:19

he was not the only factor but like

Speaker: 00:42:23

a big part of, you know, people getting into, you know, his

Speaker: 00:42:27

philosophy. As I understood it, if there's a good, if there's a good biography book

Speaker: 00:42:30

on him, I totally would want to listen to it. But basically getting

Speaker: 00:42:34

the traders access to data gave them an advantage. Right. And it was

Speaker: 00:42:38

really, he was really early on in the idea of that data is

Speaker: 00:42:42

not just something that's created as a byproduct of

Speaker: 00:42:45

transactions, but can actually be, you know, monetized

Speaker: 00:42:49

and arguably weaponized. Right. Like so.

Speaker: 00:42:52

And you know, Bloomberg terminals

Speaker: 00:42:56

before, you know, it was interesting because he basically sold these custom terminals so you'd

Speaker: 00:43:00

not to rely on like local ID who were still struggling with like, you

Speaker: 00:43:04

know, just keeping the network up and running, you know, these separate

Speaker: 00:43:07

devices that became status symbols. And ultimately he, that's become like

Speaker: 00:43:11

this media empire that, you know, I can watch Bloomberg on my

Speaker: 00:43:15

tv, I can listen to it, you know, whether it's a satellite radio or the

Speaker: 00:43:18

app or you know, FM or AM radio

Speaker: 00:43:22

stations. You know, I think it's in San Francisco, New York and

Speaker: 00:43:25

D.C. they have a big office in D.C. they always have an

Speaker: 00:43:29

interesting show called Political Capital. I think that plays

Speaker: 00:43:33

at 5pm every day. I listen to it because it's kind of the

Speaker: 00:43:37

policy side of finance and kind of what's going on in the world around.

Speaker: 00:43:41

And AI has come up a lot digital sovereignty. So it's interesting

Speaker: 00:43:45

how all of these worlds, I like your thoughts on this,

Speaker: 00:43:48

right. The worlds of finance, the worlds of tech and the worlds of policy,

Speaker: 00:43:52

politics and dare I say war. Right. They're all kind of like

Speaker: 00:43:56

crashing together in this giant thing. And

Speaker: 00:44:00

it's kind of cool, kind of scary.

Speaker: 00:44:04

I think it can be. I mean, sometimes I'm scared I was like,

Speaker: 00:44:08

you know, because you see a few things, it's like, whoa.

Speaker: 00:44:12

There's a lot I feel like for people born post Covid,

Speaker: 00:44:16

not born, but grew up post Covid, I would call Jen the second

Speaker: 00:44:19

Gen Z Gen Alpha. Yes. I think Gen

Speaker: 00:44:23

Z's apparently now like I'm all confused. But for

Speaker: 00:44:27

them it's like, of course they should. They should. My AI should be my

Speaker: 00:44:30

boyfriend, girlfriend. Right. Like whatever. And then for me it's like,

Speaker: 00:44:34

this is not comfortable at all. Weird.

Speaker: 00:44:38

Yeah, yeah, yeah. For me it's not. I have no idea what's going on.

Speaker: 00:44:41

Like, I just so creeped out by this. But for lot of people it's like,

Speaker: 00:44:44

of course you do that. Of course you tell AI all your secrets.

Speaker: 00:44:48

Of course they can. My phone can record my conversation. Of course

Speaker: 00:44:52

you can train, you know, your AI model. My

Speaker: 00:44:56

model use my all my Gmail content information.

Speaker: 00:45:00

All edge computing. I have my own AI model. Of course you can wear,

Speaker: 00:45:04

you know, glasses and then record everything you and me talk

Speaker: 00:45:07

about. And how secure is everything

Speaker: 00:45:11

right now? Right.

Speaker: 00:45:15

The hardware level encryption

Speaker: 00:45:19

is only available on a very specific few chips.

Speaker: 00:45:23

TPU can do that. You rely on software encryption.

Speaker: 00:45:28

No, it's true. And software encryption that is vulnerable to a quantum

Speaker: 00:45:32

attack which is not that far away. We are not the

Speaker: 00:45:36

software and use cases moving so quickly. The hardware hasn't been able to cut

Speaker: 00:45:39

up. And it's expensive to do hardware encryption. It takes

Speaker: 00:45:43

longer and it's more expensive. That's why sometimes the hyperscaler charging

Speaker: 00:45:47

higher premium for that reason. Right. Are you willing to spend a

Speaker: 00:45:50

token and time and effort to do so? Some use cases, you can argue.

Speaker: 00:45:54

Yes, yes, absolutely. No edge computing

Speaker: 00:45:58

chips can do that kind of hardware level encryption.

Speaker: 00:46:02

And it's happening like now. Right, Right.

Speaker: 00:46:06

I was talking to a startup called Quantum Knight. Nate claimed to have a solution

Speaker: 00:46:10

that is a low, low compute kind of post

Speaker: 00:46:13

quantum ready thing. So I can send you their

Speaker: 00:46:17

link and information. Yeah, we, we track quantum

Speaker: 00:46:20

computing prices as well. Very different than GPU pricing and like, you know, like

Speaker: 00:46:24

a thousand per second per minute pricing versus hourly. Right. This is like

Speaker: 00:46:28

different cycles you run. And then GPU become like error correction component to the whole

Speaker: 00:46:32

thing. But for us it's like, okay, so

Speaker: 00:46:36

computers compute now, GPU and tpu, whatever, pu. And then

Speaker: 00:46:39

it becomes like quantum. How we think through that? I don't

Speaker: 00:46:43

know. My brain just like, you know. Yeah, I know. At some point it just

Speaker: 00:46:47

becomes like. I'm not smart enough

Speaker: 00:46:51

right now to, to. To. To figure that out. I tell you, like I go

Speaker: 00:46:55

through like quantum stuff and like I always joke with Andy, like I'd be like

Speaker: 00:46:58

15 minutes, I get a migraine, which is basically like my brain's version of

Speaker: 00:47:02

blue screening. And like, just like, okay, I can stop. I can get

Speaker: 00:47:06

to about. I can get to about 45 minutes now, which is, you know, an

Speaker: 00:47:09

improvement. But this is actually a good book.

Speaker: 00:47:12

And he was actually a guest recently on the Quantum Computing podcast.

Speaker: 00:47:17

It's a thick book. It's a thick book. But I'll tell you this.

Speaker: 00:47:21

The, the, the, the first three chapters, introducing the concepts

Speaker: 00:47:24

are probably the single best introduction to the

Speaker: 00:47:28

concept I have ever read. Yeah, I will send you the link. Yeah,

Speaker: 00:47:31

yeah. Dancing with Cubits.

Speaker: 00:47:35

Really interesting book. Super nice author too. He's a, he's a trip.

Speaker: 00:47:40

But it, it. No,

Speaker: 00:47:43

you're right. Like, these are. The thing that really worries me is I kind of

Speaker: 00:47:46

think about this like we built our entire economy and we're, we're

Speaker: 00:47:50

on a house of sand. Can we start on this? That's

Speaker: 00:47:53

another thing. We'll have to have you back on the show for

Speaker: 00:47:57

a second one. But like other countries

Speaker: 00:48:01

where they lay off hundreds and thousands of people, not. Not just by American

Speaker: 00:48:04

companies. Right?

Speaker: 00:48:08

Yeah. Don't even get me s on that. Well, like, and like, you know, we're

Speaker: 00:48:11

all based on. And, and the other thing, the elephant in the room, right, is

Speaker: 00:48:14

the fact that TC the, the T in

Speaker: 00:48:18

TSMC stands for Taiwan. Right. Kind of.

Speaker: 00:48:23

I know, I know it's very dangerous to talk about this, but, but like. It'S

Speaker: 00:48:26

kind of like, shoot. So I won't say much, but I'll just say it's

Speaker: 00:48:29

contested real estate. How about that? Right. That's a pretty safe way to say it.

Speaker: 00:48:33

Right? It's contested. Right. And you know,

Speaker: 00:48:36

the entire world effectively revolves around the kind of modern

Speaker: 00:48:40

civilization revolves around the manufacturing that happens there. And

Speaker: 00:48:44

God forbid, like, you know, whether it's man made or a tsunami or a bad

Speaker: 00:48:47

earthquake, like, I mean, our world, I mean, we, we get sent back

Speaker: 00:48:51

to the 1700s pretty quickly. You know, 1700 is not,

Speaker: 00:48:55

you know, there are still people, they're still human beings in the

Speaker: 00:48:59

hundreds. That could be worse than that. That's true. It could be way worse than

Speaker: 00:49:02

that. That is a good point. I was trying to keep it. I was trying

Speaker: 00:49:04

to end it on a positive. And I know you're traveling there

Speaker: 00:49:08

like no humans. Well, no, I mean, like,

Speaker: 00:49:12

I mean, there's a lot of ways that the, you know, this apocalypse could go,

Speaker: 00:49:15

so to speak. Right. It could be, you know, but like, it's a very. And

Speaker: 00:49:18

like, just from an infrastructure point of view and supply chain point of view, like,

Speaker: 00:49:22

you know, we, we. We've really championed

Speaker: 00:49:25

globalism and kind of all of these extended supply

Speaker: 00:49:29

chains for, you know, there were reasons there's always reasons, but like

Speaker: 00:49:33

at the cost of resilience. Right, right. That's kind of scary.

Speaker: 00:49:37

I assume you've read Taleb, right? The like anti fragile.

Speaker: 00:49:41

I'm so sorry. No, that's fine. That's fine. But I really appreciate you taking the

Speaker: 00:49:45

time. Where can folks find out more about you? Silicon

Speaker: 00:49:49

Data.com Silicon Data.com awesome. And we'd love to have you back on the show.

Speaker: 00:49:53

And you can tell us what these conferences were like. The. The

Speaker: 00:49:56

ts. Let's see how much I can understand

Speaker: 00:49:59

first. Right, right, right, right, right. That wasn't a good question.

Speaker: 00:50:03

That's why you got to be like the kids today and record all your conversations

Speaker: 00:50:06

so you can talk to the transcript later. All right,

Speaker: 00:50:10

nice seeing you guys. All right, thank you. And we'll let our AI finish the

Speaker: 00:50:13

show. And that wraps up another episode of Data Driven, the podcast

Speaker: 00:50:17

where we ponder the future of AI data and occasionally

Speaker: 00:50:20

the fate of humanity if we don't get GPU pricing under control.

Speaker: 00:50:24

Big thanks to Carmen Lee for joining us and blowing our minds with

Speaker: 00:50:28

compute market mechanics, financial innovation, and just a

Speaker: 00:50:32

touch of economic existentialism. Be sure to check out

Speaker: 00:50:35

silicondata.com to learn more. Just don't try to day trade

Speaker: 00:50:39

H1 hundreds after midnight. If you liked what you heard,

Speaker: 00:50:42

subscribe, leave a review, or send us compute credits.

Speaker: 00:50:46

Until next time, stay curious, stay caffeinated,

Speaker: 00:50:50

and remember, in a world of exponential AI, transparency

Speaker: 00:50:54

might just be the killer app.