Speaker:

You found the backup wrap up your go-to podcast for all things

Speaker:

backup recovery and cyber recovery.

Speaker:

In this episode, we're covering forever incremental backup technology and

Speaker:

how that completely revolutionized the entire backup industry.

Speaker:

I. Remember the old days of doing weekly fos that brought

Speaker:

your systems to their knees?

Speaker:

Well, those days are thankfully gone for most people.

Speaker:

We're gonna walk through the evolution of, from traditional approaches with

Speaker:

fools and incrementals to synthetic fos, and finally to true forever incremental.

Speaker:

If you're still doing things the old way, well, you're missing

Speaker:

out on some serious benefits.

Speaker:

Better performance, lower storage costs, and way faster restores.

Speaker:

Perhaps this episode will cause you to rethink things a little bit.

Speaker:

By the way, if you don't know who I am, I'm w Curtis Preston, AKA, Mr. Backup,

Speaker:

and I've been passionate about backup and recovery for over 30 years, ever since.

Speaker:

I had to tell my boss that we had no backups of the production

Speaker:

database that we had just lost.

Speaker:

I don't want that to happen to you, and that's why I do this.

Speaker:

On this podcast, we turn unappreciated backup admins into Cyber Recovery Heroes.

Speaker:

This is the backup wrap up.

Speaker:

Welcome to the show.

Speaker:

Hi, I am w Curtis Preston, AKA, Mr. Backup, and I have with me a guy that

Speaker:

I bet is pretty dang happy right now.

Speaker:

Prasanna Malaiyandi.

Speaker:

Why are you looking like that?

Speaker:

very confused about what I'm happy about.

Speaker:

Really?

Speaker:

Do I need to explain recent events that happened at your house

Speaker:

Oh

Speaker:

that, that.

Speaker:

For those of you by the way, um, who are listening to this podcast via

Speaker:

audio, we also provide video on YouTube.

Speaker:

So if you wanted to see our expressions just then and really see what Curtis

Speaker:

is laughing about, uh, go check us

Speaker:

Yeah,

Speaker:

Backup wrap up.

Speaker:

so what events happened in the last 48 hours?

Speaker:

my wife was in India

Speaker:

Yeah.

Speaker:

Visiting family and she just got back.

Speaker:

Yes.

Speaker:

it was good.

Speaker:

Suddenly, you're so much harder to get on the phone between that and your new job.

Speaker:

I don't even know you anymore.

Speaker:

He's like, why would I be happy?

Speaker:

I, I was really, I was, I thought it was like, I was thinking, I was like,

Speaker:

was it something YouTube related?

Speaker:

Was it something like I, we had watched or we had talked about and.

Speaker:

No, the fact that your wife is back

Speaker:

although she's also slightly jet lagged, so,

Speaker:

so well, it's what, what, what's it is 12 hours difference.

Speaker:

right now it is 13 and a half.

Speaker:

Yeah.

Speaker:

Yeah.

Speaker:

So

Speaker:

does

Speaker:

of course she's,

Speaker:

jet lagged than I do.

Speaker:

So

Speaker:

yeah.

Speaker:

and come back, it takes me like three weeks.

Speaker:

And,

Speaker:

Yeah.

Speaker:

have a dog, which I think some people have seen on the podcast.

Speaker:

Um, it's funny because he gets jet lagged when I'm jet lagged.

Speaker:

So he, his hours will shift to match with mine.

Speaker:

So

Speaker:

That's really adorable.

Speaker:

and he would be like, Hey, I'm ready.

Speaker:

Let's go, let's go.

Speaker:

And then when I'm passed out, he's asleep.

Speaker:

So he's

Speaker:

Interesting.

Speaker:

Yes.

Speaker:

Your dog that's named after an Indian dessert.

Speaker:

cream.

Speaker:

Yep.

Speaker:

Ice cream.

Speaker:

Oh yeah.

Speaker:

Ice cream.

Speaker:

Yeah.

Speaker:

Kulfi, which I have now had, um.

Speaker:

your, did you know they have various flavors of Kulfi.

Speaker:

They're not all just like one.

Speaker:

Oh, I thought it was all mango.

Speaker:

So it's met like mango Kulfi and then what?

Speaker:

Coconut Kulfi.

Speaker:

What are what?

Speaker:

What are the other typical

Speaker:

and pistachio.

Speaker:

Pistachio and, and.

Speaker:

I think there are other ones too.

Speaker:

Yeah.

Speaker:

I don't, yeah, I don't.

Speaker:

I've used cardamom as a, an ingredient, you know, when I made the, the

Speaker:

mango pie thing, but I don't think I know what it tastes like by itself.

Speaker:

So I'll have to, I'll have to sprinkle some and, uh, see what that tastes like.

Speaker:

Or maybe just go and

Speaker:

Yeah.

Speaker:

some

Speaker:

next time you're at the Indian store.

Speaker:

Actually, sometimes even the Costcos carry like a sampler pack

Speaker:

Oh, yeah, so like when I go to the cash and carry down in Mira Mesa.

Speaker:

the cash and carry will definitely have it.

Speaker:

If you go check out the freezer section.

Speaker:

Just

Speaker:

Interesting.

Speaker:

all before you get home.

Speaker:

Yeah.

Speaker:

I, oh man.

Speaker:

I'm, you're, you're literally like, like my mouth is Yeah, yeah.

Speaker:

Thinking about, thinking about it sounds yummy.

Speaker:

Hmm.

Speaker:

Uh, so we're gonna talk about something that's just as yummy.

Speaker:

That is if you're into, into the backup thing, uh, we're gonna talk

Speaker:

about forever incremental and sort of.

Speaker:

Uh, just how big of a deal that is or was.

Speaker:

Um, it, at, at this point?

Speaker:

It's in, in most modern backup products that have come out in

Speaker:

the last 10 years, I'd say, uh, ones that are actually new, right?

Speaker:

So, you know, when we look at the products like Rubrik, Cohesity, or

Speaker:

modern SaaS applications like Druva, Hyku, keep it, Alcion, uh, right.

Speaker:

You know, these are all of these products.

Speaker:

Have forever.

Speaker:

Incremental in common, right?

Speaker:

Nobody wants to do full backups anymore.

Speaker:

Uh, or, or anything like full backups, right?

Speaker:

So, um, so let's talk about that.

Speaker:

Let's talk about what, you know, where were we, what was it like back in the

Speaker:

day, and, um, and then sort of how we got the, from there to here and why it's

Speaker:

such a big deal and why you should care.

Speaker:

Right.

Speaker:

And why this perhaps may give you a reason to finally, um, you

Speaker:

know, what, what is it, shuck?

Speaker:

The mortal coil.

Speaker:

What, what, what, what is that?

Speaker:

What,

Speaker:

I'm just

Speaker:

thinking are there company, and maybe as we're talking we can figure this out, like

Speaker:

are there still vendors that exist that don't support forever Incrementals?

Speaker:

That's actually a really good question and I'm gonna say, uh, absolutely

Speaker:

right, because my concept of what a true forever incremental will be based on.

Speaker:

Like a, a complete redesign of the, of the underneath.

Speaker:

Right.

Speaker:

Um, so the phrase I was looking for is shuttle off sh shuffle off the

Speaker:

mortal coil of your old backup product,

Speaker:

by the way, wait, quick story.

Speaker:

So I'm on, I'm on a plane one time and this, this, um.

Speaker:

This pilot, he was talking about the fact that we were about to take off and

Speaker:

he referred to taking off as shuffling off the mortal coil of this earth.

Speaker:

And I'm like, dude, that's a phrase for dying.

Speaker:

What?

Speaker:

Why the hell are you, why would you do that?

Speaker:

Right.

Speaker:

, it's a Shakespeare, it's from the to be or not to be soliloquy

Speaker:

I,

Speaker:

Hamlet.

Speaker:

in all fairness though, I would say 97% of people probably had

Speaker:

no idea what that phrase meant.

Speaker:

They, they never even heard.

Speaker:

Yeah.

Speaker:

Yeah.

Speaker:

And, and, and, and, and just to, just to, uh, like, you know, since I'm over

Speaker:

here quoting Shakespeare, let me just put this, I'm not a huge Shakespeare person.

Speaker:

In fact, it, it wasn't until, I dunno, several years ago, and we were in England

Speaker:

where I, I, I got to actually see my first, um, uh, Shakespeare play, which

Speaker:

was Henry V. Some people say Henry the VI don't, I think it's actually

Speaker:

called Henry five, which is the, um.

Speaker:

The one with the St. Crispin stay speech.

Speaker:

Uh, once more into the breach, close the wall up with, with

Speaker:

her English dead and all that.

Speaker:

But anyway, sorry, I just bored everybody.

Speaker:

Sorry.

Speaker:

Sorry, you're asleep.

Speaker:

Wake back up.

Speaker:

Wake back up.

Speaker:

We're, we're gonna talk about backup, which is so much more

Speaker:

exciting than Shakespeare.

Speaker:

Um, so, all right, so first let's talk about, do you want to define, uh, so what?

Speaker:

let's talk about the old days first.

Speaker:

that's what I, that's what I'm saying.

Speaker:

That's what I'm saying.

Speaker:

Let's, do you wanna define, uh, what a full and, and

Speaker:

Yeah.

Speaker:

an incremental backup is?

Speaker:

Yeah.

Speaker:

So.

Speaker:

Normally when you think about backups, right, you talk,

Speaker:

typically people talk about fulls.

Speaker:

Incrementals, right?

Speaker:

A full is basically your entire application, dataset, whatever

Speaker:

it is, everything is backed up.

Speaker:

Mm-hmm.

Speaker:

then between doing every fulls, you would do incrementals, which are just the things

Speaker:

that have changed since that last full.

Speaker:

Right, right.

Speaker:

you do these periodically, and normally you do incrementals

Speaker:

because they will be smaller.

Speaker:

So you could say do a full once a week and then incrementals during the week

Speaker:

such that you can finish in time or during your backup window, and then the next

Speaker:

weekend you would do like another full,

Speaker:

Right, right now, slightly, I'll slightly change your

Speaker:

definition or a pen or whatever.

Speaker:

Um, so you specifically described what I would call a

Speaker:

cumulative incremental, right?

Speaker:

So a, uh, what is referred to as a sort of a regular, incremental

Speaker:

or a differential incremental.

Speaker:

Yep,

Speaker:

it is one where it's the changes just since the last incremental or since

Speaker:

the last backup of any kind, right?

Speaker:

And then you

Speaker:

Um,

Speaker:

levels and all the

Speaker:

yeah.

Speaker:

Yeah, yeah,

Speaker:

But yeah, at a high level, yeah, there are those three types, if you will.

Speaker:

yeah.

Speaker:

And, and, and then what really, um, what really what it really

Speaker:

came down to is how things behaved when it came time for a restore.

Speaker:

Right.

Speaker:

So if you were going to restore something, then you needed the full,

Speaker:

the latest cumulative, incremental, if you were doing such a thing, and

Speaker:

then any incrementals since that day.

Speaker:

And so, um, those of us, uh.

Speaker:

I, I remember at some point we, we used to do weekly fulls.

Speaker:

I remember my preferred backup design switching to monthly fulls,

Speaker:

weekly cumulative incrementals, and then daily incrementals.

Speaker:

So if you were, let's say three weeks into the month, you would need the full

Speaker:

from the first of the month, the latest, uh, weekly cumulative, incremental,

Speaker:

and then three or four days worth of.

Speaker:

Incrementals

Speaker:

Yeah,

Speaker:

and, uh, in order to do a restore.

Speaker:

. So what it meant was that you would often, the way the Restore worked is you

Speaker:

would restore everything that was in the full, then you would restore everything

Speaker:

that was in the cumulative incremental, and then you would restore everything

Speaker:

that was in each of the incrementals.

Speaker:

What that meant was there were many files that you were actually restoring

Speaker:

multiple times and you were overwriting them, which was just inefficient.

Speaker:

Um, and what that also meant was.

Speaker:

That you had, uh, uh, duplicate data, the old way of doing backups created

Speaker:

a ton of duplicate data on the tapes.

Speaker:

You could say duplicate, or you could also say it's independent,

Speaker:

depending on the glass.

Speaker:

Half full glass, half empty.

Speaker:

Well, so no, I, I understand what you're saying.

Speaker:

What I'm talking about is the fulls,

Speaker:

Yeah,

Speaker:

the fulls over time created a ton of duplicate data.

Speaker:

that's true.

Speaker:

Right.

Speaker:

I know what you're thinking.

Speaker:

You're thinking, you're talking about the, within the, within the, the incrementals.

Speaker:

I'm mainly talking about

Speaker:

yeah,

Speaker:

how many different copies of the exact same file were all

Speaker:

over, you know, everywhere.

Speaker:

and especially because most times, right, you look at sort of like a

Speaker:

2% daily change rate, which maybe

Speaker:

Yeah.

Speaker:

a seven or 8% change rate per week.

Speaker:

And

Speaker:

Yeah.

Speaker:

those weekly cumulatives or even the monthly fulls, right, you're

Speaker:

basically backing up a bunch of data, which you don't necessarily

Speaker:

need that haven't changed.

Speaker:

Right, right.

Speaker:

You, you, the reason we did it the way we did, it wasn't so much about independence.

Speaker:

It was about.

Speaker:

Minimize.

Speaker:

So if you did a monthly full, followed by nothing but regular

Speaker:

incrementals, you would have to load 30 tapes at the end of the day, right?

Speaker:

At the end of, and even if it wasn't tapes, you would have

Speaker:

to, you have to do 30 restores.

Speaker:

Yeah,

Speaker:

So, so the, the, the, the differential or the cumulative incremental, um,

Speaker:

was about minimizing the level of effort during a restore, right?

Speaker:

And then came along this thing called dedupe.

Speaker:

Uh, which at least helped minimize, I think the, all of that duplicate data.

Speaker:

because I think with deduplication it was nice because you didn't

Speaker:

have to change what you were doing from like a policy perspective

Speaker:

Right,

Speaker:

perspective.

Speaker:

But you got, like you said, it was able to.

Speaker:

Dedupe, of course,

Speaker:

right.

Speaker:

in the backend and save you the storage costs.

Speaker:

And that's why when, if you, if you were doing a, a typical weekly, by

Speaker:

the way, nobody did the, the monthly full thing that I recommended, most

Speaker:

people did weekly fulls, right?

Speaker:

And so when you had weekly fulls, um.

Speaker:

This is why, and you, and you had like 90 day retention.

Speaker:

This is why just with even crappy dedupe, you got 20 to one,

Speaker:

Yeah.

Speaker:

uh, data reduction, right?

Speaker:

And this is what helped, um, pay for, you know, the what to help

Speaker:

created the, the market for all that, all that dedupe work, right?

Speaker:

And then we could argue over, um, you know, well we're, we're

Speaker:

20 to one, we're 30 to one, we're, you know, whatever, right?

Speaker:

And, um, we, we saved this much more money.

Speaker:

Um.

Speaker:

You know, once you get rid of all those fulls, everything else

Speaker:

was, was really kind of incre

Speaker:

Yeah.

Speaker:

incremental.

Speaker:

Um, but the real, I think the real challenge was just the core backup design

Speaker:

of when we go to do a restore, we have to restore, uh, data multiple times.

Speaker:

Right.

Speaker:

It just simply wasn't efficient.

Speaker:

Well, and I think even before we get there, I'd like to cover

Speaker:

Mm-hmm.

Speaker:

uh,

Speaker:

Okay.

Speaker:

if you had the existing backup schedule, right, where I'm still doing

Speaker:

weekly fulls and daily incrementals,

Speaker:

Mm-hmm.

Speaker:

still transferring all that data over the network.

Speaker:

Right?

Speaker:

And I think this is where client side deduplication would probably

Speaker:

come next, in my opinion, right?

Speaker:

Yep.

Speaker:

it would help you reduce the amount of data you're actually

Speaker:

sending over the network.

Speaker:

Right.

Speaker:

And that'll help save you cost as well as time.

Speaker:

Yeah, that's actually a really good point.

Speaker:

Thanks for bringing that up.

Speaker:

So the, one of the other like, like basically what you're saying that one

Speaker:

of the other, um, downsides to doing it the old way was both the transfer

Speaker:

across the network as well as the impact, uh, to the clients, right?

Speaker:

If you're doing a full, on a regular basis, um, you're, um.

Speaker:

There's a, there's a cost to that

Speaker:

Yep.

Speaker:

in both, you know, you brought up bandwidth, it's

Speaker:

bandwidth storage and compute.

Speaker:

Right?

Speaker:

Um, it's like all the things.

Speaker:

There was also a, um, sort of a, like a perfect storm brewing.

Speaker:

Yeah.

Speaker:

So we're doing fulls on a regular basis.

Speaker:

We're doing cumulative incrementals on a regular basis, and this is

Speaker:

how we were doing things, and then along came virtualization.

Speaker:

Why was that a problem?

Speaker:

Because a lot of the data was

Speaker:

I,

Speaker:

the same, right?

Speaker:

When you look across all of your VMs, they're probably spun up

Speaker:

from the same golden copy, of your

Speaker:

Well, the,

Speaker:

and

Speaker:

yeah, well, the.

Speaker:

amongst them.

Speaker:

Well, that, that's a, this is another example of, that's a great answer.

Speaker:

It's not the question I was asking.

Speaker:

It's

Speaker:

Um,

Speaker:

resource on the backend because

Speaker:

that's what I was going after.

Speaker:

Yeah,

Speaker:

and the compute, right?

Speaker:

You don't have your

Speaker:

yeah,

Speaker:

instance

Speaker:

yeah.

Speaker:

That's, that's what I was going after.

Speaker:

Right?

Speaker:

So, so that was a real problem when VMware got really popular and we just

Speaker:

kept doing the same old thing because.

Speaker:

Remember, for a long time we backed up VMware.

Speaker:

We backed up the physical, the virtual machines, just like

Speaker:

they were physical machines.

Speaker:

Yes.

Speaker:

Guest ba. We just basically installed an A, an agent and every guest,

Speaker:

and that's how we did the backups.

Speaker:

And, you know, and what happened was you'd do just randomly, you'd end up

Speaker:

doing a bunch of, at the same time.

Speaker:

Which worked fine when we were 20 individual machines, but now we're 20

Speaker:

VMs all on the same physical machine, and that just broke, like virtualization

Speaker:

just broke back up overnight.

Speaker:

Right.

Speaker:

Um, the latest thing to break back up overnight is, um, containers.

Speaker:

Yep.

Speaker:

That's a whole other thing.

Speaker:

We're not talking about that right now, but, um, the

Speaker:

so

Speaker:

ahead.

Speaker:

though, this is what sort of VMware, Plus the backup vendors to figure out

Speaker:

is there something better we can do

Speaker:

Yeah.

Speaker:

order to help satisfy backup needs?

Speaker:

Because it's not going away, right?

Speaker:

You have to

Speaker:

Right,

Speaker:

VMs.

Speaker:

right, right.

Speaker:

So the the real key I think, I think the real goal, I. Like, like if we could

Speaker:

go back in time and I, I don't know.

Speaker:

I don't know.

Speaker:

This is one of the things where like, I don't know if we, I, I wasn't in the

Speaker:

rooms right when they were designing this stuff, but I'd like to think that

Speaker:

somebody said, you know, if, because my memory is, we had issues with tape.

Speaker:

The, the, there was the fundamental mismatch of speed tape.

Speaker:

You know, people thought it was slow, it was actually fast.

Speaker:

That was a problem.

Speaker:

And we need, and so we started doing disc staging.

Speaker:

We started doing disc backups because of that, right?

Speaker:

And then DDU came along and DDU said, Hey, we can make disc better.

Speaker:

We can make disc, you know, more affordable.

Speaker:

And, and so, so that was, and then it just sort of happened.

Speaker:

But even if.

Speaker:

We if, if that wasn't the case, I think the design of backups ultimately ending

Speaker:

up being, uh, what we're talking about today, which is forever incremental,

Speaker:

it, it accomplishes multiple purposes.

Speaker:

One is it, it, it solves the issues that we have with tape.

Speaker:

Because in order for forever incremental work, you've gotta be using disc, right?

Speaker:

Uh, for, for true forever, incremental work to work, you've gotta be using disc.

Speaker:

Um, and then the other thing is that it, it solves the, the cost issues.

Speaker:

It solves the performance issues, restore, you know, it does a lot of things right.

Speaker:

Um,

Speaker:

you first define what forever incremental is?

Speaker:

'cause I don't

Speaker:

I don't want to,

Speaker:

that.

Speaker:

Before, before we get to forever incremental, we gotta talk about

Speaker:

something that was, that was a stop gap technology bef to get from we'd,

Speaker:

we wanted to get to a point where we weren't doing fullsS anymore.

Speaker:

Everybody agreed that full stunk, except especially when we're

Speaker:

talking about virtualization, right?

Speaker:

What was the stop gap technology that backup vendors came out with before

Speaker:

we went to the whole For incremental?

Speaker:

Uh, they created what they called synthetic fulls,

Speaker:

Mm-hmm.

Speaker:

which is basically you are.

Speaker:

Creating a full on the backend, but you're sending data and stitching it together and

Speaker:

piecing it together to make it look like what the full would look like, because you

Speaker:

Yeah.

Speaker:

backups were on that

Speaker:

Right.

Speaker:

So it's like we already know, we, we already know what the files are in

Speaker:

the full, because, you know, we did it on we, we did another incremental.

Speaker:

So we know what's deleted.

Speaker:

We know what's new.

Speaker:

We know what a full would look like.

Speaker:

If we went and made one, why don't we just make one over here

Speaker:

without having to do it over there.

Speaker:

Um, and there were a couple of different ways to do that.

Speaker:

Uh, there was the backup vendor way and there was the, um, the, well, the

Speaker:

backup software vendor way, and then there was the backup hardware vendor way.

Speaker:

Yeah.

Speaker:

I'll talk about the backup software vendor way first.

Speaker:

It's like, we know what all the files should be on a full,

Speaker:

we know where all of them are.

Speaker:

We're just gonna copy them all together.

Speaker:

You know, in one image, and then we're gonna copy that over to a new

Speaker:

set of tapes or a new set of disc

Speaker:

Mm.

Speaker:

so that we essentially create an actual full, um, without actually having to

Speaker:

transfer any data around the network.

Speaker:

It might be transferring it, uh, well it is transferring

Speaker:

it within the backup system.

Speaker:

Right.

Speaker:

And then there was the, the, and, and that took a certain amount of time.

Speaker:

Right.

Speaker:

Um, and then.

Speaker:

A, a vendor that you might be familiar with came out with

Speaker:

a different way to do it.

Speaker:

What was that?

Speaker:

And basically, um, they basically said, just tell us how you wanna

Speaker:

stitch everything together,

Speaker:

Yeah.

Speaker:

it all together.

Speaker:

Yeah.

Speaker:

So it was like, it was like an API, right.

Speaker:

You know where all the stuff is.

Speaker:

We have all the stuff.

Speaker:

Just tell us which stuff you want, where, and we'll go.

Speaker:

Yep.

Speaker:

And so they could make essentially a new full.

Speaker:

When in reality it's just a bunch of pointers to what was already there.

Speaker:

Right.

Speaker:

this

Speaker:

And it was really quick.

Speaker:

Right,

Speaker:

it was really quick because the key is, especially on a de-duplicated system,

Speaker:

you're not actually copying the data.

Speaker:

It's all pointer manipulation.

Speaker:

right, right.

Speaker:

moving any data, you're just changing metadata, which is very quick, very fast.

Speaker:

That was a great singing group from the nineties,

Speaker:

What

Speaker:

I

Speaker:

metadata.

Speaker:

manipulation.

Speaker:

But I'm, anyway, so yeah, so that was sort of a stop gap solution.

Speaker:

It allowed people to stop doing real fulls, solve the issue of the, the,

Speaker:

the load on the client, solve the issue of the load on the network.

Speaker:

But we still, we were still creating fulls.

Speaker:

So the concept of forever incremental.

Speaker:

And, and true forever incremental is that we first have to design a backup product

Speaker:

that just no longer needs repeated fulls.

Speaker:

Right?

Speaker:

And, and by the way, this generally does only apply to, um.

Speaker:

Uh, file system backups for the most part.

Speaker:

Um, not database and application backups.

Speaker:

Most of them still require repeated fullss.

Speaker:

Um, you know, we'll, we'll, some of, some of them have forever incremental, but.

Speaker:

I would also say, or would you also say for virtualized

Speaker:

workloads, this also applies 'cause

Speaker:

Um, oh, you mean, uh, the, the backup of the di of the system itself?

Speaker:

Yes.

Speaker:

Yes.

Speaker:

So, well, so I'll, so if I, if, if I was to use a term that would apply to,

Speaker:

to both of them, I'd say volume-based.

Speaker:

Yeah.

Speaker:

Backups.

Speaker:

Right?

Speaker:

So either a file system or the, or a,

Speaker:

Disc.

Speaker:

file or a VDK file, right?

Speaker:

Um, and the idea is that we're just gonna do one full, and then

Speaker:

we're generally going to be doing block level incremental backups.

Speaker:

Um, not just incremental, but block level incremental.

Speaker:

What's the difference between those?

Speaker:

Well, a block level incremental allows you to look at changes

Speaker:

within, a specific object.

Speaker:

As an example, say you had, uh, 10 gigabyte PST.

Speaker:

File representing your email and instead of doing a normal incremental, which

Speaker:

would transfer the entire 10 gig file over when you back it up using traditional

Speaker:

incrementals, block level incremental would just take the changes in within that

Speaker:

10 gig PST file and transfer that over.

Speaker:

Exactly, and so it, it, it's another data reduction technology like deduplication.

Speaker:

And it, it, it just, it, I I think it's a, it was an order of

Speaker:

magnitude reduction in the number of bytes that has to be transferred.

Speaker:

And, and the way this worked in, in, uh, file system backups was that

Speaker:

you had to have like an API into the actual blocks, the, the discs, right.

Speaker:

Uh, which.

Speaker:

Was somewhat problematic.

Speaker:

The, the, the easier way to get this was in the virtualization

Speaker:

world where you had an API, right, like the change block tracking API

Speaker:

Mm-hmm.

Speaker:

you can just, the backup app can just say to the, the virtualization

Speaker:

vendor, Hey, I'm here to back up again.

Speaker:

Just give me a pointer to all of the blocks that have changed

Speaker:

since the last time I was here.

Speaker:

Yep.

Speaker:

Um, and then we can transfer just those change blocks, but.

Speaker:

But.

Speaker:

importantly, for a true increment, for a true forever incremental, um, to, to, to,

Speaker:

for me to think that it's a truly forever incremental is that you're going to change

Speaker:

how the data is stored on the backend.

Speaker:

So there really isn't a concept of full, right?

Speaker:

Um, the way that you're storing each new backup is that each new backup from a

Speaker:

restore perspective behaves like a full.

Speaker:

Yeah.

Speaker:

Right.

Speaker:

You

Speaker:

Yeah.

Speaker:

want to like put that in?

Speaker:

Go ahead.

Speaker:

so I was just gonna put it in my own words, which

Speaker:

Yeah.

Speaker:

basically you're able to send only the changes and end up with

Speaker:

the full on the backend, such

Speaker:

Right.

Speaker:

each one is a full, they can be independently restored without

Speaker:

any dependencies on others.

Speaker:

Yeah.

Speaker:

So you don't have this idea of restoring the full and restoring

Speaker:

the incrementals individually.

Speaker:

Right.

Speaker:

You just restore the late, you know, it's sort of, it's, it's sort of like take what

Speaker:

we used to do, take the synthetic full.

Speaker:

Every time we do an incremental, we're essentially creating a new synthetic full.

Speaker:

Yep,

Speaker:

How's that?

Speaker:

yep.

Speaker:

Yeah.

Speaker:

Yeah, because if you think about it, it's just kind of changing

Speaker:

where the processing happened,

Speaker:

Right.

Speaker:

Because with Synthetic fulls, the client was telling the storage, okay, these

Speaker:

are the pieces I want you to stitch together, and it might have been pieces

Speaker:

across multiple different backups,

Speaker:

Mm-hmm.

Speaker:

right?

Speaker:

In the case of an incremental forever, it's like, Hey, you know

Speaker:

your last backup go apply these changes on top of that last backup

Speaker:

Yeah.

Speaker:

Yeah.

Speaker:

offsets.

Speaker:

Right.

Speaker:

Um, and go ahead.

Speaker:

it needs to be a system that allows you to keep those copies as independent,

Speaker:

What do you, what do you mean by that?

Speaker:

um, uh, backup technologies which say, which claim to be incremental forever.

Speaker:

Mm-hmm.

Speaker:

They only keep the latest copy,

Speaker:

Right.

Speaker:

for

Speaker:

I.

Speaker:

you have a full, you apply the changes, you now get a new full.

Speaker:

That old full is gone from the system.

Speaker:

Hmm.

Speaker:

Yeah.

Speaker:

have the most recent copy, right?

Speaker:

And normally for backup, because you want to be able to go back further in

Speaker:

time, you need to make sure you are preserving those previous copies as well.

Speaker:

And so it

Speaker:

Yeah.

Speaker:

require a technology like deduplication, which allows you to efficiently store

Speaker:

multiple copies, snapshots, whatever technology is available on the backend

Speaker:

in order to make that possible.

Speaker:

Yeah, so snapshots and replication.

Speaker:

A great example of forever incremental technology, right?

Speaker:

Each new, um.

Speaker:

Each time you do a snapshot and a replication, you get a new, something

Speaker:

that looks like a full, uh, when really all you only transferred was the, the,

Speaker:

you know, the bites that have changed.

Speaker:

And then, um, I can, I can immediately think of when I was,

Speaker:

when you were talking, I was like, what is he talking about?

Speaker:

And then I realized what you were talking about.

Speaker:

So here's an example of forever incremental that isn't what we're

Speaker:

talking about, and that's replication.

Speaker:

It

Speaker:

Yep.

Speaker:

Right.

Speaker:

Re just regular replication without anything

Speaker:

Yep,

Speaker:

beside it.

Speaker:

You end up with each new, every time you do a a, an if, if you want to call

Speaker:

it that, in an incremental backup, all you're left with is the latest

Speaker:

version and you can't go back in time.

Speaker:

You know, you know what I was actually thinking of when I made those comments?

Speaker:

Oh, what were you,

Speaker:

Oracle incremental merge

Speaker:

oh, okay.

Speaker:

because Oracle incremental merge applies to changes on top of a full, but it

Speaker:

does not preserve the previous versions.

Speaker:

Right.

Speaker:

Yeah, that's a good point.

Speaker:

Um, which, which wouldn't it, that's really just a, a type

Speaker:

of, synthetic full, right?

Speaker:

Yeah, yeah, I didn't even realize that, by the way.

Speaker:

Um, so how do you preserve the history?

Speaker:

That's why

Speaker:

Do you just.

Speaker:

do you need a application that, or a storage system that

Speaker:

allows you to take snapshots or

Speaker:

okay.

Speaker:

copy of the file before it

Speaker:

gotcha.

Speaker:

Why do

Speaker:

where was it?

Speaker:

Curtis?

Speaker:

Why do I still remember these things?

Speaker:

It's why you're here?

Speaker:

So we've defined what a

Speaker:

Mm-hmm.

Speaker:

a little bit about the benefits,

Speaker:

Yeah.

Speaker:

Being able to quickly restore because you have the full copy, uh, being independent,

Speaker:

um, not having to transfer all the data.

Speaker:

Right?

Speaker:

Um, can

Speaker:

And also also cost savings.

Speaker:

Right, right.

Speaker:

Yeah.

Speaker:

you think of any cons?

Speaker:

Well, the, the only thing you know, one of the things I said

Speaker:

in the beginning was that.

Speaker:

In order to do forever, incremental, successfully, you've

Speaker:

got to store the data on disc.

Speaker:

And, and again, not, not to be, you know, pro tape and all that stuff,

Speaker:

but we did an episode relatively recently where we talked about how you

Speaker:

have to, uh, protect the backup disc.

Speaker:

The only con that I can think of is that your entire backup is just sitting

Speaker:

there ready to be attacked, right.

Speaker:

Yeah,

Speaker:

Um.

Speaker:

I have, I have a

Speaker:

Go ahead.

Speaker:

You have, you have a con, okay?

Speaker:

So having worked with vendors, and I'm not saying this has happened, but

Speaker:

it is a potential thing you should think about is when you have forever

Speaker:

incrementals, You're always re.

Speaker:

Copying over the existing backup

Speaker:

Mm-hmm.

Speaker:

bug that somehow makes, that corrupts a part of your backup, right?

Speaker:

You've now impacted how many versions back because you're never getting a

Speaker:

full copy of that data again, right?

Speaker:

It's always

Speaker:

Right,

Speaker:

And so

Speaker:

Yeah.

Speaker:

be one of the cons a

Speaker:

Yeah.

Speaker:

approach.

Speaker:

That was, that was a, um, a critique of ddu, uh, because that

Speaker:

has the same issue, a critique of dedupe when Dedupe first came out.

Speaker:

And all you can say is that you just do your best to ensure

Speaker:

that that's not happening.

Speaker:

And that's why we do fingerprinting and, and, you know, integrity

Speaker:

checks and all that kind of stuff.

Speaker:

your backup.

Speaker:

Yes.

Speaker:

And also test your backups.

Speaker:

Right.

Speaker:

Test your backup, test your backup, test your backups.

Speaker:

Um, but go ahead.

Speaker:

I do have another one too.

Speaker:

If you, unless you

Speaker:

Oh, you do?

Speaker:

Okay.

Speaker:

Go ahead.

Speaker:

The other one I also wanted to mention is, right if Forever incremental backups

Speaker:

are so great, why isn't everyone using it?

Speaker:

Mm-hmm.

Speaker:

one of the challenges is it requires vendor support.

Speaker:

Yeah.

Speaker:

application to support it.

Speaker:

So VMware is supported for virtualization.

Speaker:

companies support it for their own applications, but

Speaker:

not everyone supports it.

Speaker:

Not everyone makes it available.

Speaker:

Yeah.

Speaker:

so you sort of are limited in terms of what applications, what

Speaker:

APIs are available, and can you get the changes that you need in

Speaker:

order to be able to create your forever incremental in the backend.

Speaker:

Exactly, and, and know, I can think of modern database applications

Speaker:

that do support it, right?

Speaker:

Um, I can think, for example, uh, when you back up Salesforce, right?

Speaker:

You're backing up at the object level and it's very easy to.

Speaker:

Uh, and, and let me rephrase that, because Salesforce uses

Speaker:

the term object to mean something very different than what we mean.

Speaker:

Um, because why should we all use the same terms?

Speaker:

of course.

Speaker:

A Salesforce object is like the user's table, right?

Speaker:

Um, so I'm talking about like, I made an update to the, to a, you know, a

Speaker:

field in a record in a table, and, um.

Speaker:

Uh, you can back up that change as right, and, and, and you can do it incrementally

Speaker:

or you can do it independently.

Speaker:

And so it would be very easy to store all of those, those

Speaker:

objects in a, in a way that.

Speaker:

It, you know, would, would foster this idea of a forever incremental.

Speaker:

Um, whereas like Oracle for example, there really isn't a way to do it.

Speaker:

The, at best what we can get out of Oracle is the concept of a, um, synthetic full,

Speaker:

uh, and the synthetic full that they have.

Speaker:

Has the issue that you, that you mentioned before, whereas once you

Speaker:

do the synthetic full, you don't have the previous incrementals.

Speaker:

Um, so yeah, it, that is, that is definitely a, um, it, it

Speaker:

requires your backup vendors to support it, and it requires your

Speaker:

application vendors to support it.

Speaker:

Yeah.

Speaker:

Absolutely.

Speaker:

I can't remember where I was going before you brought up that other, um, I, I,

Speaker:

I will say this, that, you know, the title of this is, you know, how Forever

Speaker:

Incremental Backup changed the world.

Speaker:

It, it, it really has, right?

Speaker:

The idea, when you look at these modern backup products, the

Speaker:

idea of doing a, of a, of a full backup is just, it's just gone.

Speaker:

Right?

Speaker:

Um, all of the downsides to doing a regular full backup.

Speaker:

Are, are gone.

Speaker:

Uh, the, the, the, the, the compute considerations, the

Speaker:

network considerations, the storage considerations, they're all gone without

Speaker:

any negative like impact to the restore.

Speaker:

In fact, it's the opposite.

Speaker:

The restore is now, since we know what a full looks like when we go to do the

Speaker:

restore, that is to going back to Harken, back to the original way we did restores.

Speaker:

We're no longer restoring the same file multiple times during a single restore.

Speaker:

Why would we do that?

Speaker:

We know where all the files are, we know where all the blocks are, whatever

Speaker:

it is that we're, we're restoring.

Speaker:

We know what the latest version of that block here file is, and

Speaker:

we're just gonna restore that one.

Speaker:

I have a story.

Speaker:

Um, how you do forever, incremental Matters.

Speaker:

The original, the OG, forever incremental product is TSM.

Speaker:

mm-hmm.

Speaker:

And when, and the way it was implemented back then was to tape.

Speaker:

And they, tried to bring this concept of forever incremental to the world without

Speaker:

changing the underlying storage technology because back then it was, it was, uh.

Speaker:

tape.

Speaker:

It was, uh, if you, if you brought disc into it, it was, it was crazy talk, right?

Speaker:

So the, so the, so, so they can get a lot of credit for trying to

Speaker:

accomplish this, this, um, this idea.

Speaker:

But I, I still think when we look back to TSM of old, I, I think it was, I

Speaker:

think it was a failed attempt because my earlier statement of the only proper way

Speaker:

to do forever incremental is to do it on disc when they were doing it on tape.

Speaker:

What it meant was that when you, um, when you went to go to do a restore, the, the,

Speaker:

the full was on like 5,700 tapes, right?

Speaker:

sounds worse than just doing your traditional fulls and incrementals.

Speaker:

Yeah, well, it was, and I remember, I, I remember doing a, there was a one

Speaker:

particular restore that I remember doing using TSM in a production environment.

Speaker:

And I, I don't remember the actual numbers, but I'll, I'll just use like,

Speaker:

uh, I'll just use fake numbers just to, to, to to, to describe what it was like.

Speaker:

It was like we were doing like a 10 gigabyte restore and it took like.

Speaker:

Two weeks because of how many hundreds of tapes that it was.

Speaker:

Thousands of tapes.

Speaker:

Um, because of the number of files, the number of tapes that they were

Speaker:

on, and how long it takes to load and unload an individual tape.

Speaker:

Right?

Speaker:

Um, and they had technologies, uh, called reclamation and they, they,

Speaker:

they had a number of technologies that tried to mini minimize this issue.

Speaker:

Um, but

Speaker:

It still existed.

Speaker:

well, yeah, it still is existed.

Speaker:

And also some people, they would run out of time in the day to do reclamation.

Speaker:

Reclamation is where you take a bunch of tapes that, um, like they were, that

Speaker:

most of the files on those, on those tapes were no longer needed because

Speaker:

they, they'd expired and then you take, yeah, I've got a bunch of files that are

Speaker:

10% and then you copy them to one tape

Speaker:

It's

Speaker:

to that's now a hundred percent.

Speaker:

It's like garbage collection, but on tape.

Speaker:

And it was a very, um, labor intensive process and, um, IO intensive process.

Speaker:

And as a result, it took a long time.

Speaker:

And so some people would, would turn it off.

Speaker:

Um, and, uh, I think that's what happened in, in, in this case.

Speaker:

And so there, there were literally just, I don't rem there were so many

Speaker:

tapes that we had to restore and it, you know, it sounds like I'm bagging

Speaker:

on T-S-M-T-S-M is still around.

Speaker:

It's now called, uh, spectrum Protect.

Speaker:

Yeah, but the addition of bringing in disc, I think makes this, this whole

Speaker:

issue kind of, uh, not a big deal,

Speaker:

Yeah.

Speaker:

but, um, but yeah, I still think that you look at all of the modern backup

Speaker:

products that have come out in the last 10 plus years, and they're all like this.

Speaker:

They're all one full, followed by a whole bunch of incrementals, and

Speaker:

the world is a better place for it.

Speaker:

And, um, you young whippersnappers have never had to deal with

Speaker:

tape

Speaker:

incremental backups.

Speaker:

You just don't know

Speaker:

what

Speaker:

what you uh, yeah.

Speaker:

You don't know the pain that you, that Yeah.

Speaker:

That you had.

Speaker:

You know, it's funny, this morning I was hanging out with some people and I

Speaker:

was talking to 'em, and they're, it's this lady who shares my birthday, but

Speaker:

she's about 15 years older than me,

Speaker:

Hmm.

Speaker:

we started talking about all of the technology that we

Speaker:

remember when it came out.

Speaker:

Right.

Speaker:

And the first, the first.

Speaker:

Piece of technology I remember being introduced in my life was the, the VCR

Speaker:

Hmm.

Speaker:

and it, it came, they, I remember them wheeling one into the classroom

Speaker:

Hmm.

Speaker:

in my school.

Speaker:

And it was gigantic.

Speaker:

Right.

Speaker:

It was like, and it was the kind where you put the tape in and push it down, right?

Speaker:

Yeah.

Speaker:

Um, anyway, this is, this is like that, this is like the

Speaker:

way, the way we did backups.

Speaker:

And by the way, you know.

Speaker:

I'm, I'm glad I, I'm glad I remembered this because you asked a question earlier.

Speaker:

Why is it that not everybody does things in this new way?

Speaker:

Yeah.

Speaker:

Because backup is the stickiest application in the data center.

Speaker:

Yep.

Speaker:

So many people are still using the same backup application

Speaker:

that they were 15 years ago.

Speaker:

Some people change their applications like they change

Speaker:

their underwear, but other people.

Speaker:

They just, it, it's a pain.

Speaker:

It's costly, it's risk, uh, it's fraught with risk and, and, and

Speaker:

difficulty and a giant learning curve.

Speaker:

And during that learning curve, your, your data's at risk.

Speaker:

And so most people just, they'd rather stick with the ugly, old backup product.

Speaker:

it.

Speaker:

Because it works.

Speaker:

Yeah.

Speaker:

And, and, and so it, it, it gave birth to incrementally helpful

Speaker:

technologies like deduplication.

Speaker:

Don't change anything.

Speaker:

Just send your backups over here instead of sending your backups over there.

Speaker:

And we'll get rid of the duplicates and we'll make it sort of look like a,

Speaker:

an incremental forever, but just keep doing it the dumb old way because we

Speaker:

don't want you to have to change your backup product in order to get to,

Speaker:

and, and I guess this is, you know.

Speaker:

If, if you're one of those people that's still using the backup, the

Speaker:

same backup product that you had 15 years ago and you're still using it

Speaker:

the same way you were using it 15 years ago, and you're not using incremental

Speaker:

forever as a, a true technology in your world, maybe time to think about it.

Speaker:

but I, I would probably say.

Speaker:

Newer workloads, people are probably using that, right?

Speaker:

Because once again, environments are a mishmash of technologies, right?

Speaker:

Yeah.

Speaker:

people probably do have those old databases, old applications,

Speaker:

which they are using an old infrastructure backup infrastructure

Speaker:

for, and for their virtualization or other newer workloads, they're

Speaker:

probably using something different.

Speaker:

Yeah.

Speaker:

Uh, remember I did that, I did that survey a few, uh, months ago and

Speaker:

I asked them how many backup apps they had, and 30% had 30, I think

Speaker:

more than 30% had three or more backup apps in their environment.

Speaker:

Yep.

Speaker:

And it's because of what you're talking about.

Speaker:

They have, they start using.

Speaker:

Containers and their current backup app doesn't know what to do.

Speaker:

And so they use that for the new technology and yeah.

Speaker:

Yeah, that's very common.

Speaker:

And I guess what I'm saying is maybe it's time to check out the water of

Speaker:

the, of, of truly incremental backups and, and experience all the beauty.

Speaker:

Yeah.

Speaker:

All right.

Speaker:

Well thanks for the chat again.

Speaker:

Anytime Curtis.

Speaker:

Anytime.

Speaker:

All right.

Speaker:

And we wanna thank our listeners.

Speaker:

Yo, you're why we're here, you know, otherwise, just two guys sitting here

Speaker:

talking over a couple of microphones.

Speaker:

Uh, that is a wrap.