Speaker:

In today's episode, Frank and Andy sit down with special guest

Speaker:

Max Sklar to delve into the world of artificial intelligence, data

Speaker:

science, and data engineering. Max, a

Speaker:

trailblazer in location data and machine learning, shares

Speaker:

insights from his extensive experience at Foursquare, including his

Speaker:

work on local search and bias correction. Get ready

Speaker:

for a thought provoking discussion about groundbreaking projects, tech

Speaker:

drama, and the ever evolving landscape of technology.

Speaker:

So sit back, relax, and prepare to be amazed.

Speaker:

Hello, and welcome to Data Driven, the Podcast, we explore the

Speaker:

emergent fields of artificial intelligence, data science, data engineering,

Speaker:

and all that good stuff. With me on this ever present,

Speaker:

journey down the information superhighway is Andy Leonard. How's it going, Andy?

Speaker:

Good, Frank. How are you doing? I'm doing alright. It's been a wild 24

Speaker:

hours in, Maison Levin, or or

Speaker:

Maison, Lavinia, depending on your, how you wanna pronounce it.

Speaker:

At one point, we will share those crazy

Speaker:

details, but it's been good most of the part. I I am,

Speaker:

recovering from a, a bout of COVID,

Speaker:

that hit the entire house. I think we all picked it up on the cruise.

Speaker:

Again, there are worse places to pick up COVID, and there's worse

Speaker:

things that could happen. I've I've sneezed quite a

Speaker:

bit, I've coughed quite a bit, but The thing that's bugging me the most is

Speaker:

this headache I had now for 48 straight hours.

Speaker:

But it's okay, like I'm kind of living, learning to live with it, and I've

Speaker:

actually given it a name, I call it Charlie. So, you

Speaker:

know, Charlie is is is gonna be on the show today as

Speaker:

well. I see. Yeah. You do sound a little different, but not much.

Speaker:

Nah. That's why I need to clone my voice, and, maybe how they

Speaker:

do the avatar stays. As I was saying in the virtual dream room, this is

Speaker:

the 1st time I've I've felt fit for camera In about

Speaker:

a week. Perhaps you can use some of that good

Speaker:

AI voice modulation. There you go. We are

Speaker:

pleased to present, with us, today is Max Sklar.

Speaker:

Max is a, not only a fellow, data

Speaker:

guy, but also a Fellow podcaster.

Speaker:

Welcome to the show, Max. Thank you so much, Frank and Andy,

Speaker:

for, for having me on. I'm looking forward to it. And,

Speaker:

yeah. Thank you. Cool. So you you've been at you were at a

Speaker:

company for a number of years. You were talking about this in the virtual dream

Speaker:

room. You were at Foursquare for, oh, quite some time. Right.

Speaker:

That's unusual, actually. 10 years at a at a company as as, you

Speaker:

know, building software is pretty unusual. But I was really there For, I

Speaker:

think, like, 3 different phases, so I kind of break it up into 3

Speaker:

jobs. Interesting. So, you know,

Speaker:

Foursquare is one of those companies that

Speaker:

On when they broke up into 2 different parts, it kind of I didn't

Speaker:

understand it. Like, this is just user. I'm not I'm not I know 1 I

Speaker:

know that sounds terrible, but despite being my headache, Charlie talking, but, like,

Speaker:

what, you know, obviously, that was a decision

Speaker:

that was made in the marketing department, and I don't think people probably thought that

Speaker:

through. But I love that location. Like, I I think, you know, like, I was

Speaker:

the mayor of, like, 5 6 different places. It was such a cool

Speaker:

creative, concept that if you go there long

Speaker:

enough, you're the mayor, and some places would offer the mayor, like, a free coffee

Speaker:

and a donut or something like that. Like, it was really clever.

Speaker:

I know. I used to I mean, that was one of the things that got

Speaker:

me really excited about it back in the day. And this is already 10 years

Speaker:

ago. I assume you're talking about the, The app split back in

Speaker:

2014, which I guess is That sounds about right. I I guess

Speaker:

almost 10 years. I guess it's It it's actually been 10 years since we started

Speaker:

working on that, which was also how it worked. But,

Speaker:

no. I I mean, I actually remember a lot of what was going on

Speaker:

at the time, and, we could talk about that a little more in a

Speaker:

second. I I I I I'm happy to talk about that.

Speaker:

I I think the the whole gamification thing was really

Speaker:

exciting. And I really loved the idea that I can go into

Speaker:

a place and be like, hey. I was here for 5 times. Oh, we're gonna

Speaker:

give you some free, you know, free, you know, dessert or

Speaker:

free chicken nuggets or whatever. Maybe these days, you know, kind of trying to watch

Speaker:

what I eat a little bit more, it might not be as exciting. I I

Speaker:

it was really sad that that didn't scale as a business. I think what

Speaker:

would happen was, and and this I both experienced

Speaker:

this personally. And also from talking to the leadership there, it was clear

Speaker:

that they felt it it, You know, it it it wasn't a

Speaker:

sustainable thing because you'd often go into a place and say, hey, you know,

Speaker:

I checked in here all these times. I got all these rewards. You know,

Speaker:

I'm supposed to get, like, a dollar off, or I'm supposed to get some

Speaker:

some free dessert or something, which is It's always exciting

Speaker:

to get a little reward, even if it's not, you know, even if it's just

Speaker:

a dollar or whatever. It's always but I think what would end up happening is

Speaker:

like the the man, the The management who worked there or the or the

Speaker:

bartender or whatever would know about it. And, you know,

Speaker:

once a few times that, you know, you got that thing. Oh, oh, let me

Speaker:

go in the back and check. And then they come back in, like, 20 minutes,

Speaker:

and it's like a whole big deal. They're like, maybe I don't wanna do this

Speaker:

again. That's interesting, because, like, from my point of view,

Speaker:

it was always very It was very fun, like, so I was mayor of,

Speaker:

there used to be a ferry service between, in the

Speaker:

suburbs of DC, between Poolesville, Maryland and

Speaker:

Leesburg, Virginia. And, I would take that ferry a lot so

Speaker:

much. So I was actually the mayor of the ferry. It didn't get

Speaker:

me anything, like, in that case, but it was this kind of cool, like I

Speaker:

I love to I I always imagined, like, there would be, You know, like

Speaker:

the ferry could have some kind of a a little like TV screen that could

Speaker:

like show who the mayor was. And then you can, you know, and and and

Speaker:

then then then we can like kind of scale up those fights, But alas,

Speaker:

society didn't go in that direction. Interestingly,

Speaker:

Foursquare, what, like, I have been interested in that this like local

Speaker:

local search space since since well before forest grabbing even my, you

Speaker:

know, my my senior project as an undergrad, back,

Speaker:

though, starting in 2005, was this, like, you know, this this website

Speaker:

called sticky map, where people would post little, Icons all

Speaker:

over Google Map. It's kind of inspired by Wikipedia. You can add messages.

Speaker:

And I just thought it was pretty cool. People started marking up the, the campus,

Speaker:

and then people started, you know, marking up. It was like, well, it's based on

Speaker:

Google Maps API, so you could just mark up anything you want. And

Speaker:

I think in that first project, I noticed All the problems

Speaker:

that still exist with that kind of data today. Okay. What happens when you

Speaker:

add duplicates? It's the first thing that happened. As, you know, As an undergrad, I

Speaker:

was like, oh, I'm so excited. Let me show something like this. They're like, okay,

Speaker:

let me create a marker. And I'm like, don't create that one. It's already been

Speaker:

created. And then it's like, okay, now we have duplicates Right off the bat. And

Speaker:

that is still something that, you know, Foursquare

Speaker:

deals with and I'm sure Google deals with and Apple Maps deals with and then

Speaker:

they they all deal with it. So,

Speaker:

Yeah. It it it it I I think when I

Speaker:

discovered Foursquare, you know, several years later,

Speaker:

it was the it was innovative in several ways. It was, first of

Speaker:

all, it was based on on mobile apps actually being at the place when you're

Speaker:

commenting on it, which is exciting. It was the gamification of it. And the

Speaker:

fact And and for those listening Yeah. For those listening, that was new. Like,

Speaker:

that was brand new. So sorry. I didn't mean to cut you off, but, like,

Speaker:

the context is important because All I had was was something on a on

Speaker:

a website. You know, I I wasn't thinking of the I the iPhone didn't

Speaker:

exist, for actually, pre Foursquare,

Speaker:

there there was something called dodgeball, which was kind of the the

Speaker:

predecessor to Foursquare, Which I wasn't involved with, but it but, it

Speaker:

it was based on, like, you know, SMS kind of text messages where people were

Speaker:

messaging on. If you remember, You know, those, you know,

Speaker:

the the the what was it? The t nine texting where people were Oh,

Speaker:

God. Yeah. Yeah. Yeah. But people would use that and and Foursquare and,

Speaker:

Google bought that. And then the the the founders,

Speaker:

Dennis Nautz went it went in and started Foursquare after that.

Speaker:

So, very, there there's a very

Speaker:

interesting history of kind of, like, Local search, city guide,

Speaker:

and basically sort of social kinda local applications. We're

Speaker:

we're very big at the time. Nowadays, I think we need to find a new

Speaker:

take on it, but, that when I joined Forescord

Speaker:

in 2011, it was very exciting. I'm glad you mentioned that because there was

Speaker:

a a real so I started my career in New York City. Right?

Speaker:

So I worked at Barnes and Noble com. So I was there in the fairly

Speaker:

early days of Silicon Alley, and that was a huge thing. It was

Speaker:

Microsoft. I think it was Microsoft had something called Sidewalk.

Speaker:

And then there was there was maybe it was maybe it wasn't Microsoft. Maybe

Speaker:

somebody else had something called sidewalk AOL had something, going and the fact

Speaker:

you can't remember it, I think says it all. Right? Like, it was like in

Speaker:

in, you know, anybody that could register a .com could spell and could

Speaker:

spell HTML To get funding back in those days, a

Speaker:

bit like the way the AI startup ecosystem is kinda

Speaker:

today. But, no, there was I mean,

Speaker:

people there was a time, young children

Speaker:

out there, That when, you know, people saw the online

Speaker:

world is slightly different than the real world, and they saw this as an opportunity.

Speaker:

Right? But no, I'd like That takes me back. As soon as

Speaker:

you said to, like, the local kind of connection guides, I was like, wow, it

Speaker:

takes me back. Yeah. Yeah. Coming back to

Speaker:

the app split, and I wasn't expecting to talk about this today. I'm sorry if

Speaker:

it brings up If for what it's worth, I'm a former Windows For what it's

Speaker:

worth, I'm a former Windows phone developer, and I wrote a book on silver light.

Speaker:

So I understand the pain of working on

Speaker:

It'll fade and I worked at barnes and noble.com. Right? So there's my trifecta of

Speaker:

ill fated technology projects. Yeah. I I think it's

Speaker:

A lot of technology companies, in order to become successful, actually have

Speaker:

to go through big changes where people yell at them.

Speaker:

And so it's like, how do you know whether you're breaking things or whether you're

Speaker:

actually doing what you're supposed to do? And so that's kind of a Right. That's

Speaker:

that's kind of a tough decision. I I think For Foursquare

Speaker:

at the time, there was always, like, kind of a design,

Speaker:

and product, like, tension between the people who

Speaker:

wanted to be there As essentially like a Yelp replacement, kind of

Speaker:

like a a local search city guide. And then there and then versus the people

Speaker:

who were there for the the life logging, the check ins, the game. And I

Speaker:

think, I think the separation could have been

Speaker:

done. I mean, my personal thing is I think there could have been a

Speaker:

Separation, I could think, could have been executed a little bit better. I

Speaker:

think technically we did a good job. I think the apps that

Speaker:

we ended up with were well designed, but I think, I think

Speaker:

we needed to do is take into account how the how people were using the

Speaker:

apps, at the time and not just, like, Kick all the

Speaker:

people who are checking in, which was which was Foursquare's kind of bread and butter

Speaker:

and just, like, kick them to the side with this other app. And then it's

Speaker:

like, well, what is this? I'm calling this something different. Sorta. That was that was,

Speaker:

I think, too much. But, again,

Speaker:

there there were people saying it at the time. But

Speaker:

The problem is, I I guess, you know, there whenever you make a

Speaker:

change, there's always a great many people saying a great many things. So

Speaker:

We could wax this we could wax nostalgic about Foursquare because I used to

Speaker:

like, when I I was travelling a lot at the time when I worked I

Speaker:

worked at Microsoft about 10 years ago. But I remember Sometimes I would actually

Speaker:

choose different connecting airports, so I could get, like, the the

Speaker:

jet that was it, the the jet set tag, like level up in my achievement

Speaker:

there. Right. Which is kinda sad, but, we could

Speaker:

we could wax nostalgic about that all day, and I would love to. But I

Speaker:

think what What was the role of AI and ML in that

Speaker:

space? Right? Because you're obviously collecting a lot of data. No. Like, I'm just curious,

Speaker:

like, because how how was that being used? How was that,

Speaker:

leverage. In in in a lot of ways,

Speaker:

and, you know, many of which I I worked on over all those years.

Speaker:

You know, one of them was, I mean, just, you know, search

Speaker:

ranking in general, which, you know, Foursquare had a lot of ex Google engineers.

Speaker:

So I learned directly from them so they they knew what to do.

Speaker:

But search ranking, search ranking was a was a was a big project.

Speaker:

This is kind of more of a statistical Problem where you were kinda trying

Speaker:

to weight different attributes, like, is this related to the search the

Speaker:

person put in? Is this related to how much do we wanna, You know,

Speaker:

score things that are, you know, maybe someone's

Speaker:

friends went to. So something like that. I think that

Speaker:

The biggest well, I'll talk about the one that I think is the biggest deal

Speaker:

and then the one that that I worked on the most. The one I think

Speaker:

was the biggest deal for for Foursquare, which I did work on a little

Speaker:

bit, is basically trying to figure out where

Speaker:

someone is given. So we know where someone is given there that long

Speaker:

from their phone. But it's like, what are they actually in a particular

Speaker:

store? Like, are they in the Starbucks? Are they in the, you know, are they

Speaker:

in the office, over there? Are they Are they just walking down

Speaker:

the street? And so using the fact that people were were

Speaker:

giving us training data, essentially, which was a big theme there, which is, you know,

Speaker:

I think, something that,

Speaker:

data scientists and data entrepreneurs need to need to look in closely, which is

Speaker:

like, How can you get people to give you training data? Because it is really

Speaker:

useful. So if you have people giving you where they are and

Speaker:

then you could see the information from their phone, not just a lot long, but

Speaker:

like what, You know, things like what Wi Fi's can you see? What, you know,

Speaker:

other sensors from your phone, can you figure out where they are? And then there's

Speaker:

the whole stop detection, problem. And so, Yep.

Speaker:

Foursquare essentially can kinda figure out, you know, where you went day

Speaker:

to day, and it's actually pretty good. Like, you know, if I Don't tell Foursquare

Speaker:

where I went. Even today, I still look at it and, you know, it tells

Speaker:

me what, what actual stores I was in. Now maybe there's a question of,

Speaker:

you know, whether whether our apps are knowing too much about us, but

Speaker:

that's that's a whole another question. But that was a very important,

Speaker:

a resource for the business. And the one that I worked on the most, that

Speaker:

was the most exciting though for me was the natural language

Speaker:

processing Pipeline. And, of course, you know, text text

Speaker:

data today is is is having such a a resurgence

Speaker:

with, You know, I don't need to tell your audience with AI and all that.

Speaker:

But, you know, it it it back then it was like, well, people were giving

Speaker:

us, you know, several sentences called tips on Foursquare Venues,

Speaker:

which would often be like, here's, you know, here's what you should do here. Here's

Speaker:

what you should try. Here's a little review, something like that. People were

Speaker:

leaving text with their check-in. So there's a bunch of texts, there's

Speaker:

menus, things like that. So there's a bunch of texts in the system. And so

Speaker:

it's like, what do we do with all of that? And, one of the things

Speaker:

that we did was we pulled out key terms, you know, noun phrase detection.

Speaker:

This is all kind of standard natural language

Speaker:

processing. You know, not

Speaker:

you know, you know, people often ask, oh, you know, I I think

Speaker:

Nowadays, I'm often thinking everyone's thinking, oh, you were probably using, like,

Speaker:

generative AI or something. No. It was just kind of standard NLP that had

Speaker:

been developed over the last, you know, several decades. But,

Speaker:

we did sentiment analysis and we used that to come up with the ratings for

Speaker:

the venues, which which are used today. So you could tell

Speaker:

how good something is. And,

Speaker:

you know, I did some things that were a little bit

Speaker:

more interesting that, you know, maybe get overlooked, but they're

Speaker:

they're kind of unique to To to what we did there, which was sort of

Speaker:

like timeliness and seasonality, which is so, like, if you check into

Speaker:

a diner in the morning versus in the afternoon, It'll statistically

Speaker:

give you different suggestions based on how timely it thinks each

Speaker:

each suggestion is. Because with every Check-in where someone is doing

Speaker:

something in real time. We have the timestamp. We know what time of day. We

Speaker:

know what time of week. We know what time of year. And so it's kind

Speaker:

of cool to to put that all together. And some of

Speaker:

the some of the, some of the

Speaker:

models, got pretty,

Speaker:

you know, it was it was pretty neat how it all turned out. I think

Speaker:

that one I you know, I still talk about that one is one of my

Speaker:

That's my favorite one after being in the industry so long, even though it was

Speaker:

like 10 years ago, because it was like, okay, we had training

Speaker:

data again from on these tips where, You know, we

Speaker:

could tell if the person liked the venue or disliked the venue and because

Speaker:

they they told us, and they also left the tip. There were a lot of

Speaker:

people who did that. So that just gave us training data for sentiment analysis.

Speaker:

And at the time, I'm sure the tools now are much more sophisticated at the

Speaker:

time when we use pretrained sentiment analysis tools, Didn't really

Speaker:

work well on our data because it's just it was just a different kind of

Speaker:

text. People wrote on Foursquare differently than they did on Twitter, for

Speaker:

example. So, so that gave us training data. Give

Speaker:

us training data for every language. And so that was nice. We got kind of,

Speaker:

like, you know, 90 languages for free just by just by

Speaker:

using that Strategy of Oh, wow. Using the data that people gave us.

Speaker:

Probably not probably didn't work very well in all 90, but certainly worked

Speaker:

well. Well, the beauty of it is It ends up working

Speaker:

well so long as we have good language detection, it ends up working well

Speaker:

in, any language that has any Particular,

Speaker:

you know, any particular popularity in Foursquare.

Speaker:

So for example, if, the Turkish was very popular. Okay. Well, that

Speaker:

means we have a lot of Turkish training data. That means that the

Speaker:

the model, which trains monthly, is Is going to use all that training data. That

Speaker:

means it's going to work very well. And so, and

Speaker:

so that the fact that the models were always regenerating

Speaker:

And they were always regenerating based on the latest data

Speaker:

was was really cool because oftentimes you think these think about

Speaker:

ML teams kind of building a model, and then they kind of throw

Speaker:

it over a wall. They they productionize it. And then you have to

Speaker:

work on the next one, but you have to you have to do some work.

Speaker:

It's not automated, you know. So it's like, well, this is this is gonna start

Speaker:

going downhill If we don't, if we don't interact. And the fact that we were

Speaker:

able to set it up where it was just constantly getting smarter was,

Speaker:

was pretty neat. So MLOps and pipelines before they were called

Speaker:

MLOps. Well, they might have been called pipelines, but yeah. Interesting. Yeah.

Speaker:

Pipeline was a big Big big key phrase. So what

Speaker:

what did the data what did the back it because like one of the one

Speaker:

of the jokes that we have, and in fact, it's a domain name that I

Speaker:

registered. 1st, you get the data, is a phrase that

Speaker:

a lot of data scientists will often use, much to the chagrin of a lot

Speaker:

of data engineers, because a lot of data,

Speaker:

you have to get the data in a certain way to to format it and

Speaker:

and and to get it trained. And if you go to first you get

Speaker:

the data.com, it should redirect you to our website,

Speaker:

hopefully. God only knows if it works. 1st, I think

Speaker:

yeah. I'm gonna I'm gonna try to get the data.com.

Speaker:

I'm shattered to think that okay. Good. It does work. Okay. DNS

Speaker:

and me have a long history. Yes. It's going back.

Speaker:

Good. I I it's always good to start off a week with a win with

Speaker:

DNS. What did it what did the

Speaker:

because I'm curious, like, Foursquare was one of those early, kind

Speaker:

of mobile first, kind of success stories. I'm

Speaker:

always curious, what did the back end data platform look like? Right?

Speaker:

Because, and again, going back 10 years, I

Speaker:

mean, I mean, did you use what was the name of the,

Speaker:

gosh. Can't think of the name of the platform, but what sorts of technologies did

Speaker:

you did you guys use? Yeah. I'm I mean, I'm sure

Speaker:

it works similar today in in at at Foursquare. Mhmm.

Speaker:

Well, we were using data pipe I assume. But, yeah,

Speaker:

if I remember correctly, we had, you know, our transactional database, our Mongo

Speaker:

database that was sort of like, Every once in a

Speaker:

while. And so that was kinda like the baseline. And then there'd be a series

Speaker:

of jobs that, like, you know, built it up, that that

Speaker:

that kind of Calculated things off of that, and that

Speaker:

would, in the at the end of that pipeline, you know, release

Speaker:

a, a dataset that would then be kind of,

Speaker:

Automatically, deployed and then read by,

Speaker:

read by the server in real time. So, if I can think of, like, the

Speaker:

technologies, I think the, the pipeline technology, the

Speaker:

pipeline, what was it? It was

Speaker:

like Luigi. It was written in pipe. Python. I don't know if that's too interesting.

Speaker:

There's a lot of different ones you could use these days.

Speaker:

It's an interesting question of, like, you know, which one do you use? I,

Speaker:

it's It's probably,

Speaker:

you know, from from my point of view, it's always like, well, the company kinda

Speaker:

chooses it. You don't really have much of a say.

Speaker:

And then then it's like, well, well, how do I know how to compare? But

Speaker:

let's see. Like, you know, we were using MapReduce jobs. We're using Hadoop At the

Speaker:

time, I think scalding, was was one that's that's maybe kind of out of fashion

Speaker:

now. That was a, a scala based framework for for

Speaker:

some of these jobs which were, which which

Speaker:

was based on abstract algebra. So it's actually pretty cool. I wish it

Speaker:

was. It was kinda hard to to reason about sometimes if you

Speaker:

It kind of went too far, to the side of, okay,

Speaker:

you know, I love abstract algebra, but I don't want everybody

Speaker:

who I don't I don't want that to be a barrier to entry for people

Speaker:

who are we're working on this. But, I'm

Speaker:

just trying to remember, like, some of the, You know, some of the some of

Speaker:

the tech bud buzzwords. But if you have any specific questions, maybe they'll jog jog

Speaker:

my memory. I don't know. Like, one of the things that was popular about that

Speaker:

time was, HBase. Oh,

Speaker:

yes. I, were we using interest? I think we were using,

Speaker:

Yeah, I remember that Term, but I I know. I know. I was as you

Speaker:

were talking, I'm like using it or if we wanted to use it. It was

Speaker:

one of those 2. Now from that if memory serves, I think Facebook is the

Speaker:

one who pioneered h base because it was really it was a right once read

Speaker:

many thing, and basically, the last one in when,

Speaker:

last last I can't talk, sorry. Last one wins.

Speaker:

Let Andy help might help me out if with the

Speaker:

technical term for that last one. Last one wins. What's the

Speaker:

Oh, yeah. So right. I remember they were called h files, so it must have

Speaker:

been yes. It must have been that. Yeah. Yeah. Yeah. That was one

Speaker:

of those sorry, Andy. Go ahead. That's okay. You were I was thinking,

Speaker:

you know, ChipLogic last in first out. Yeah.

Speaker:

Yeah. Something like that. Yeah. Know if that's what you were after or not. Last

Speaker:

one wins. Last one That was their concurrency strategy.

Speaker:

That's, I know there's a better term for that, but again,

Speaker:

it's a Monday and, I have

Speaker:

a headache. But no, it's it's it's it's

Speaker:

fascinating to kind of Almost like technology

Speaker:

archaeology. Like what worth the big projects

Speaker:

that were popular at the time. Right? You know, and it's

Speaker:

just, And it's scary to think that, you know, we're talking 10 years

Speaker:

ago. I mean, I mean, you Not even though. A lot of this stuff was

Speaker:

I mean, a lot of this stuff is probably still in place at Foursquare today.

Speaker:

Yeah. I mean, what's interesting to me is you you mentioned a lot of

Speaker:

the NLP, techniques that, You know, for lack

Speaker:

of better term, people would consider legacy now, right? Because they're pre

Speaker:

transformers, right? They're pre GPT, right? Sentiment

Speaker:

analysis, a lot of, you know, I I speak with a lot of people with

Speaker:

varying degrees of technology skills, and they

Speaker:

assume that this field of research didn't exist prior to

Speaker:

last year. And, very

Speaker:

much not the case. It's just that radically changed about a year ago.

Speaker:

Right. I mean and and this is something that I'm trying to figure out how

Speaker:

to do, which, I I might not be alone. It's like, okay. I did

Speaker:

all these things. How do I reinvent myself now in this new world?

Speaker:

And, you know, once you realize it could be exciting thing, then

Speaker:

it's maybe not so much of a drag, you know, because there's there's so many

Speaker:

opportunities out there. But it's like, but I can't be

Speaker:

the only one out there who's struggling with this being like, okay,

Speaker:

wow, I've got a, You know, I I've gotta,

Speaker:

you know, work or at least do projects for companies that

Speaker:

are at the cutting edge here in order to, In order to be,

Speaker:

you know, in order to be at the forefront. Yeah. It's funny, like, you miss,

Speaker:

like, I was, You know, offline for I tried to be offline, but for the

Speaker:

better part of a week and for vacation.

Speaker:

And like During that week, AMD announces that they are

Speaker:

producing their own, GPU LLM

Speaker:

type hardware. Gemini comes out and all

Speaker:

these other innovations that come out, and I'm like, I feel, like, hopelessly behind

Speaker:

now. I'm being offline for a week. Yeah.

Speaker:

Yeah. It's I mean, I I guess the

Speaker:

only, consolation there is everyone's dealing with that.

Speaker:

Right. You know? Yeah. Right. And Kinda like impostor

Speaker:

syndrome. Right? Yeah. Yeah. I think I think the

Speaker:

question is, especially in this new world of generative

Speaker:

AI. And and the question I'm asking I don't necessarily have answers. But it's

Speaker:

like, how do you so You wanna jump in the stream

Speaker:

and get all the latest stuff, but you also want to leverage your experience

Speaker:

and understanding. Cannot be leveraged. And

Speaker:

so what's the best way to to, you know, what

Speaker:

what's the best way to balance that? I think that's something that I would like

Speaker:

to see more people asking. And I would like guidance on this. I know I'm

Speaker:

the guest. I'm supposed to say what I know, but No. But try now. You

Speaker:

know, Mads, the, the thing is a lot of the stuff that's

Speaker:

That's new. I'm doing the air quotes here for people who are listening.

Speaker:

A lot of things that are new are really coming out of tech That was

Speaker:

developed. The math was developed, for instance, in the late sixties, seventies,

Speaker:

eighties, nineties. So a lot of that is just being reapplied

Speaker:

Back when the math was developed and the theorems and and such,

Speaker:

we didn't have machines fast enough to do it or at least do it

Speaker:

usefully. So I wouldn't feel bad at all about,

Speaker:

you know, having a bunch of, a bunch of experience that seems dated

Speaker:

right now because A couple of weeks to a couple of months. That might

Speaker:

be the new shiny. Right. That's true. When I went back to

Speaker:

a a university computer science program, You know, they're still studying

Speaker:

data structures and algorithms. It's still very relevant.

Speaker:

And, you know, I think a lot of outsiders think, oh, everything's

Speaker:

gonna Turnover in, in a year and a lot of things

Speaker:

do. But there are also a lot of kind of like universal,

Speaker:

kind of, there's a lot of universal Theory that's, good to know

Speaker:

about. Sure. The fundamentals don't change that often.

Speaker:

Nope. And it's a lot of reapplications. I see a lot of people

Speaker:

reapplying stuff 2 or 3 times. I mean, I've been I've been

Speaker:

around computing since 1975. So I've seen kinda like these meta

Speaker:

patterns flow, you know, through several generations,

Speaker:

and they kinda keep just resurfacing. One of one of the

Speaker:

interesting ones is, like, the, well,

Speaker:

both the chatbot and the text based interface versus the,

Speaker:

Graphical interface seems like we keep going back and forth. You know,

Speaker:

I I remember chatbots back in the, you know, AOL days.

Speaker:

AIM days probably way before that too.

Speaker:

And then, you know, and then There was kind

Speaker:

of a a a chatbot resurgence in, you

Speaker:

know, 2016, 2015, whenever when every company wanted

Speaker:

a chatbot and we're excited about that. Yeah. It didn't quite work. It

Speaker:

seemed to fizzle out. Then, you know, the

Speaker:

the then nowadays, we have So many chat interfaces,

Speaker:

chat GPT and and generative AI seems to be resurgent again.

Speaker:

So there are these weird sine waves, these weird

Speaker:

cycles, and I almost think of it as a coil where, you know, you're starting

Speaker:

at the bottom and you're cycling, but you're also moving up at the same time.

Speaker:

And so How do you how do you surf the wave? That's, that's,

Speaker:

something that's once you kind of, understand the

Speaker:

fact that that's what you're doing, then then then you can be excited about it.

Speaker:

I I think it's fair. Well, we're at that point in the show

Speaker:

where we transition to our, questions. And, we

Speaker:

dropped them into the chat here for you. Our very first one is how did

Speaker:

you find your way into data? Did data find you or did you find

Speaker:

data, Max? Interesting. Well, I

Speaker:

guess I was always interested in math and computer

Speaker:

science. You know, going back to undergrad, you

Speaker:

know, it was like there was a lot of different areas I could choose. I

Speaker:

had a hard time going into a field that, you know, where I

Speaker:

wasn't, using all different parts of my brain and

Speaker:

computer science department, it was it was not just the

Speaker:

mathematics. It was, you know, there was, you know,

Speaker:

there was a bunch of creativity in it as well. There was human computer interface.

Speaker:

There was it. So, So I was kind of, I gravitated to that field

Speaker:

as an undergrad. When I graduated, I I joined a company

Speaker:

called wireless generation, which, today is called Amplify.

Speaker:

And that's it was an education tech company. And I was

Speaker:

doing, you know, some simple kind of software engineering work. Actually, back

Speaker:

then, It was, which sounds really dated now, but, you know, they

Speaker:

were probably doing this up to, like, 2010, which was, you know,

Speaker:

writing c plus plus for the palm pilot. You know, we yeah.

Speaker:

Because it was they were assessing students and then it would sync to to

Speaker:

the web and all that. And Sure. It was a lot of, like, taking

Speaker:

stuff, Taking that information out of databases and putting it into a a

Speaker:

dashboard. And it was it was you know, I I felt like there

Speaker:

could be something more interesting I was doing even though I love kind of the

Speaker:

mission of that company there. So I ended up in grad school. I ended up

Speaker:

at NYU and I went there from I guess

Speaker:

2009 to 2011 really discovered,

Speaker:

you know, data mining, was the 1st related

Speaker:

class. Then I took, You know, machine learning, natural language processing. Actually,

Speaker:

the the machine learning class was with, Jan Lacun, who is,

Speaker:

a very well known machine learning researcher. He's Like The Lani.

Speaker:

The the the the the the the the the the the the the the the

Speaker:

the the the the the the the the the the the the the the the

Speaker:

the the the the the the the the the the the the the the the

Speaker:

the the the the the the the the the the the the the the the

Speaker:

the the the the the the the the the the the the the the the

Speaker:

the the the the the the the the the the the the the the the

Speaker:

the the the the the the the the the the the You know, all the

Speaker:

stuff that exists today. Like, even this was 2010. He would show us a camera

Speaker:

where he would point to different objects. He'd be like key, wallet,

Speaker:

chair, and it would like, the the the text would appear

Speaker:

on the the screen based on what he pointed at. So they knew how to

Speaker:

do all this stuff, that that you think of as as kind of

Speaker:

it it's it almost seems crazy that that was not, like,

Speaker:

and and turned into a product that anyone could use back then that it almost

Speaker:

seems crazy that it took you know so long to do it but they and

Speaker:

actually it it may have been Used by someone.

Speaker:

It's, sure. Maybe we just don't know about it.

Speaker:

Sorry. My paranoia. No. No. You're right. I'm sure it was used quite

Speaker:

a bit, but it it it's just like what it was that kind

Speaker:

of Sitting on his laptop was so much more sophisticated than anything that

Speaker:

that I I saw a year later. But,

Speaker:

Yeah. So it was That was kind of inspiring. And so it was

Speaker:

like, you know, it was

Speaker:

to me, it seemed like a much more interesting problem. Well, how do you How

Speaker:

does the machine learn? You know? How do you, you know, I don't I don't

Speaker:

wanna sit around writing code that's just dead. I want it to To

Speaker:

be alive, I wanted to to learn from experience. And so when you dive into

Speaker:

that question, well, then you get into machine learning, which is actually Pretty well

Speaker:

named. And then and, you figure, okay, well, you need

Speaker:

data to learn from, and then that that ends up being a statistical model

Speaker:

and so on and so forth. So, you know, when I

Speaker:

so Foursquare, was a company that that essentially came

Speaker:

out of NYU And, you know, it kind of intersected. So

Speaker:

and and they wanted to, to learn from from

Speaker:

their data. They wanted to kind of, sort of a

Speaker:

to build a data science team. And so I had already been

Speaker:

working on that sticky map project, And I was into local search. I

Speaker:

loved the the product aspect. I didn't have my new

Speaker:

interest in machine learning and LP in there. So it all kinda came together. And

Speaker:

so that's why I think that was such a good fit for me and probably,

Speaker:

probably would be very difficult to find such a fit again.

Speaker:

Our next question is, what's your favorite part of your current

Speaker:

gig? And that was, in The virtual green room, you said you

Speaker:

kinda had a good story about that.

Speaker:

Right. So I don't. Well, I don't exactly have a a

Speaker:

current gig right now. I have a bunch of different projects that I'm working on.

Speaker:

It was you know, I think It it was on one hand, it was

Speaker:

nice in Foursquare to be able to focus on one thing, and I'm gonna come

Speaker:

back to that. But I feel like you need these periods, almost like the same

Speaker:

as the grad school period That I had, back in 2010 where it was

Speaker:

like, well, you're working on a few different side projects, but let's see.

Speaker:

Hopefully, like, eventually it'll coalesce into something,

Speaker:

you know, something a little bit more long term and permanent. So I'm working on

Speaker:

several projects. One is with with the Foursquare founder, Dennis

Speaker:

Crowley. And we are Working on a new product, a new

Speaker:

kind of city guide where you walk around the city with your headphones in,

Speaker:

with your AirPods in or whatever. And We kinda know what you're passing,

Speaker:

by. We sort of are are using some of the Foursquare

Speaker:

tools that are publicly available that we know about, but also, You know, we're

Speaker:

kind of rigging up our our own thing because we've just done it so many

Speaker:

times. You know how to do it. We're okay. We know what

Speaker:

stores and stuff you're walking past. So what kind of sounds can we play? Right

Speaker:

now, it's a bunch of text to speech. Essentially, the way I've rigged it up,

Speaker:

the the old version 0, the alpha version is, you know,

Speaker:

we asked chat g p t or OpenAI API what to say. So it's

Speaker:

basically like you're you're walking down the street hearing,

Speaker:

content From OpenAI. Interestingly, OpenAI

Speaker:

seems to the the GPT seems to know,

Speaker:

stuff about Every place along the way, like, you don't

Speaker:

have to go into, like, location based database.

Speaker:

It seems to seems to know quite a bit. There is a question of the

Speaker:

all the content is there's some interesting content in there, but it all ends up

Speaker:

being kind of mediocre. So it's like, okay, well, how do we turn this into

Speaker:

something really cool? I think, you know, in the end, having,

Speaker:

you know, you know, maybe music and and and speeches and an art

Speaker:

project somehow in there, based on where you walk is an interesting

Speaker:

idea. So if I could That'd be cool. Yeah. I could be like a

Speaker:

platform that people can use, like a cultural version of

Speaker:

Foursquare. Yeah. Yeah. And or maybe it's just

Speaker:

like an enhancement of the the sounds of the city. Or maybe

Speaker:

it's, You know, I mean, a lot of people think, okay, maybe maybe a tour

Speaker:

guide. I I don't know. But, you know, it it's it feels like,

Speaker:

It feels like there needs to be, a variety

Speaker:

of use cases tried because there's there's a lot you could do with it. And

Speaker:

and Maybe, you know, if if you put this in the hands of more

Speaker:

creative or of of additional creative people, they would,

Speaker:

ultimately find something interesting. I'm also working

Speaker:

yeah. Oh, I could answer questions about that. But then my other project is my

Speaker:

other 2 projects are are kind of interesting as well. Well, I have the

Speaker:

podcast, The Local Maximum. So still doing that every week and, you know,

Speaker:

interviewing people. Talking about,

Speaker:

talking about data, talking about AI, you know, few episodes on the

Speaker:

whole. You know, all the drama around OpenAI recently.

Speaker:

I I never wanted to become kind of the the the,

Speaker:

the the the tech drama, you know, what's it called?

Speaker:

TMZ of technology? Yeah. Yeah. But but that's something that happened because

Speaker:

I remember, like, last year, a couple years ago, there was all this craziness coming

Speaker:

out of Google with, You know, there was 1 guy at Google who said,

Speaker:

you know, he thought that the LLM has come to life. And Oh, yeah. And

Speaker:

then and then there was a there was A whole

Speaker:

bunch of stuff with, like, the the AI safety, you

Speaker:

know, seemingly staffed

Speaker:

by, people who are a little nutty. And

Speaker:

so, it was a And they fired a bunch of

Speaker:

people From that team too. So, like, there's

Speaker:

definitely, it was something weird some weird mojo

Speaker:

was going around. That's for sure. Yeah. And when when I cover that, I

Speaker:

mean, it's hard to, you know, it's hard to hide the fact where it's like,

Speaker:

wow, everyone in this story seems kinda nutty. But I also try to, you know,

Speaker:

I try to take a step back and say, okay, this is what we know.

Speaker:

These are a few things that could be happening internally, but we don't know everything.

Speaker:

I'm not gonna jump to conclusions. But,

Speaker:

I I I I try when I'm covering a story in a local maximum to

Speaker:

give, like, a a balanced, a balanced version of

Speaker:

of whatever story I come across. You know, maybe it's my show as I try

Speaker:

to give my opinion. But, yeah, I I

Speaker:

I my attempts, which, you know, some people have have,

Speaker:

said I I I've captured that. But my my attempt is to sort

Speaker:

of, try to try to

Speaker:

approach each Story with a little bit of humility and try

Speaker:

to help people understand what's going on without the

Speaker:

raw emotion that you get often on on Twitter. Gotcha. That's a good

Speaker:

point. Yeah. So we have, go ahead. I'm

Speaker:

sorry. Oh, no. No. It's okay. Go ahead. Okay. So we got,

Speaker:

3 complete dishonest. And, the first is when

Speaker:

I'm not working, I enjoy blank. Right.

Speaker:

So now that I've moved to Connecticut, I feel like I

Speaker:

am such a a Connecticut stereotype where I kinda, like, Drive

Speaker:

around, going for walks in the woods and into

Speaker:

various malls and stuff. So it's like it's like it's When the

Speaker:

weather's good, you go into the woods. When the weather's bad, you go into the

Speaker:

mall. Yeah. So I I actually like enjoy doing

Speaker:

that. I enjoy listening to podcasts.

Speaker:

I, honestly, enjoy hanging out with friends. You know,

Speaker:

after, I used to live in New York City. I enjoyed

Speaker:

it a lot, and I sorta had this, situation where

Speaker:

I had this be careful what you wish for because, at the end of 2019,

Speaker:

I was like, oh my god. I'm going to, like, events every single day. It's

Speaker:

just it's just too much. How can we, like, how can we, you know,

Speaker:

cut back on that. And then COVID came. And then to me, it was just

Speaker:

it was the worst thing because it was like, okay, you stay in your apartment

Speaker:

in New York City all day and you don't go and and talk to anyone.

Speaker:

And it was just like it it it was just awful. It just

Speaker:

felt like a a prison. So I I

Speaker:

moved to New Hampshire for a couple years, then I came back. But, you

Speaker:

know, nowadays, when I get a chance to hang out with with friends and and

Speaker:

family, I just I try to do it, whenever I can because I'm

Speaker:

not like, you know, it's not like when I was living in New York in

Speaker:

the 2010s and got kind of overload on that.

Speaker:

Right. Right. So, yeah. That's my answer there. And we have another complete this

Speaker:

sentence. I think the coolest thing in technology is blank.

Speaker:

The the way I've been putting it recently Is this,

Speaker:

where, you know. It.

Speaker:

You know, back maybe 10 years ago, the story we

Speaker:

were getting that the hopeful story we were getting was that, okay, if you're an

Speaker:

engineer, you could Build anything you want at

Speaker:

a very low cost or if you're not an engineer for anyone because we

Speaker:

have access to social media. You know, you can,

Speaker:

you can put anything out there into the world that you want

Speaker:

and and have people read it if if if they want to, or have people

Speaker:

look at it if they want to. And so that was kind of the new

Speaker:

exciting world. I think today, The new exciting

Speaker:

world goes well beyond that, which is going to be like,

Speaker:

you you can create worlds. Any Any world that you wanna

Speaker:

build, any scenario that you can imagine, you

Speaker:

can just have a machine fill in all the gaps for you and, You

Speaker:

know, write the write the story, make the

Speaker:

image and maybe, like, you know, make the make the video, make the whole

Speaker:

world. So I think, I I I think the

Speaker:

idea with generative AI that I want people thinking about more that that I

Speaker:

I also wanna think about more is, like, Okay. If you could create any world

Speaker:

you want, to explore, to live in, just

Speaker:

to, you know, maybe it's something to to teach us about something. Maybe it maybe

Speaker:

it's just an artistic adventure

Speaker:

venture, you know, what kind of world do you want to create because that's

Speaker:

that's going to become very cheap very quickly. Yeah. I could

Speaker:

see that. So I'm gonna skip to,

Speaker:

share something different about yourself. But we remind our guests

Speaker:

to remember it's a family podcast.

Speaker:

Okay. And I'm, you know, I'm I'm trying to,

Speaker:

I'm I'm trying to think of an answer here, and it's not because,

Speaker:

it's it's not because of the, of the of the caveat there,

Speaker:

but it No. No. I get it. Well, you've already covered a lot

Speaker:

that's It's different. I did. It is good stuff. Yeah. I mean, I

Speaker:

think, I think one thing that, It's,

Speaker:

I I enjoy doing that that that I forgot to mention, because

Speaker:

I'm actually doing it again for the 1st time in in in 6 years was,

Speaker:

I was a member of the Yale Alumni Service Corps. It's not

Speaker:

a member. It's like you can do a a it was essentially we were doing

Speaker:

trips to underserved Communities around the world and,

Speaker:

you know, doing little, like, kinda, kind of service

Speaker:

trips where you'd ever Either build a structure or work with small business

Speaker:

owners or go, you know, teach in a school. And so I've been

Speaker:

to, Nicaragua and Ghana And I actually

Speaker:

got to lead one of their trips in 2017 and that was to the Fort

Speaker:

Mohave Indian reservation. Very different kind of a trip because

Speaker:

it was Within the United States here. And

Speaker:

so it was honestly a lot easier. Because that's very cool

Speaker:

flying to Vegas. But yeah. But we're actually going back there, in in a few

Speaker:

months after 6 years. And so I was so even though it was less

Speaker:

convenient this time around, I'm I'm very excited Do that. And so

Speaker:

I I don't know. I really like learning about different cultures,

Speaker:

different philosophies, different religions. I think A lot of people might

Speaker:

assume given the you know given the tenor of my

Speaker:

podcast that I'm very like you know rationalist and I talk about Bayesian

Speaker:

inference a lot. But, I I

Speaker:

sort of venture out of that a lot. I don't think that,

Speaker:

That raw math can, can explain everything in life. And I also love like the

Speaker:

diversity of, of cultures and stuff. So No. It's cool.

Speaker:

That so that's maybe a positive thing, so I I don't know. It's very positive.

Speaker:

Something different. No. It definitely is. It definitely is. So where can

Speaker:

folks find out more about you and what Sure. So you mentioned you have a

Speaker:

podcast, which I love the name, The Local Maximum.

Speaker:

Right. Yeah. Local maximum's triple entendre, because it's got my name, Max.

Speaker:

It's a local maximum is, of course, you know, in machine learning,

Speaker:

when you're when you're trying to find the, well, sometimes it's often the local minimum

Speaker:

if you're trying to, Minimize the the loss function, but the

Speaker:

in basing inference, if you're trying to maximize the probability, whatever, you're you you get

Speaker:

stuck stuck in one of your next which was Your name is Max. So Yeah.

Speaker:

Exactly. Right. Right. That's the first one. That's the second one. And then, you know,

Speaker:

I I worked on location data a lot, so it's kind of a a triple

Speaker:

meaning. And so, I've

Speaker:

been doing that for for quite a while. You can go back into kind of

Speaker:

a a really extensive library there. And, I have

Speaker:

the website local max radio.com. I have.

Speaker:

If you go, I have local maximum labs. If you go to local maximum local

Speaker:

max radio.com/labs, I have a

Speaker:

bunch of papers And, you know, kind

Speaker:

of works that I've done, which, you know, includes some

Speaker:

discussion of machine learning, like kind of the math mathematics behind bias correction,

Speaker:

but But also something kind of fun that I did, like, with the podcast last

Speaker:

year, which is, like, I just rewrote the US constitution, fixed a bunch of

Speaker:

things just because I I felt that was fun. I was Taken aback by how

Speaker:

mad people get when you when you do that, it's not like I was actually

Speaker:

trying to, you know, run a political campaign for it. I just thought it was

Speaker:

a a fun project, and I learned a lot. But Some once you venture into

Speaker:

the political, people start treating things different. People get angry pretty quick.

Speaker:

Yes. That's true. Yeah. Yeah. I I I The

Speaker:

I I I love to hear criticisms on it. I wanna hear what what what

Speaker:

people think. But The one criticism

Speaker:

that I get a lot, which I really hate is, like, how dare you spend

Speaker:

your free time on this, which I I just don't get at all.

Speaker:

Yeah. But, which, you know, whenever I put

Speaker:

out some kind of math paper, even if it's like and there there is one

Speaker:

called relative probability, which is, you know, sort

Speaker:

of an abstract paper where it's like, okay, a reimagined probability

Speaker:

theory as, okay, let's say you can't talk about The probability of something

Speaker:

happening. Let's say you can only talk about 1 probability relative

Speaker:

to another. What, what does that look like? And I just stated some basic facts

Speaker:

and, you know, Not that many people gonna use it. Maybe people

Speaker:

won't use it for for a while. I I feel like it's an interesting idea,

Speaker:

and I feel like it will have uses eventually. But,

Speaker:

You know, nobody criticized me for that. For like, how dare you spend your free

Speaker:

time on that? Exactly. They they pick on you for the other

Speaker:

stuff. Yeah. I mean, I I look at people. I mean, you spend your free

Speaker:

time yelling at people on Twitter. I mean, what's the difference? I was gonna say

Speaker:

you can you can look at TikTok, and you can find far more destructive uses

Speaker:

is of Exactly. Exactly. So that's so that's that's my

Speaker:

main thing. I I think maybe with the, with the Constitution, I think people have

Speaker:

their sort of ideal society in mind. And if If your thing doesn't wind up

Speaker:

with that, they they kind of perceive you as a threat. Like you're trying to,

Speaker:

like I was trying to revitalize democracy, but some people are saying, no, you're

Speaker:

backsliding on democracy. Alright. Like, let's talk about it. But, yeah, it's it's

Speaker:

people get you know, people get different. We need to have you back and talk

Speaker:

about that more. Yeah. For sure. For sure. Talk about that. Absolutely.

Speaker:

We'd love having you. Both Andy and I, however, do have a hard stop, and

Speaker:

I would love this This covers you to go out for a couple hours, and

Speaker:

we'll talk to you more. And I just had a a conversation last night with

Speaker:

my cohost that went a couple hours. I know how it goes. Yeah. Yeah. Yeah.

Speaker:

We ended at 1 AM, and I was like, oh my god.

Speaker:

Well, those 1 AM conversations. I know what you mean. You got it. So

Speaker:

With that, we'll definitely make sure. Send us all your links, and

Speaker:

we'll make sure we get them in the show notes, and we'll let Bailey, our

Speaker:

semi extension AI host, Co host, 3rd

Speaker:

host, wrap up the show. And thank you, dear

Speaker:

listener, for subscribing to our podcast. You

Speaker:

have subscribed to us, haven't you? Once you do,

Speaker:

please be sure to rate and review our podcast on iTunes, Stitcher,

Speaker:

or wherever you subscribe to us. Having high ratings

Speaker:

and reviews helps us improve the quality of our show and rank us more

Speaker:

favorably with the search algorithms. That means more

Speaker:

people listen to us, spreading the joy. And,

Speaker:

can't the world use a little more joy these days?

Speaker:

So, go do your part to make the world just a little better and be

Speaker:

sure to rate and review the show.