You found the backup wrap up your go-to podcast for all things
Speaker:backup recovery and cyber recovery.
Speaker:In this episode, we're covering forever incremental backup technology and
Speaker:how that completely revolutionized the entire backup industry.
Speaker:I. Remember the old days of doing weekly fos that brought
Speaker:your systems to their knees?
Speaker:Well, those days are thankfully gone for most people.
Speaker:We're gonna walk through the evolution of, from traditional approaches with
Speaker:fools and incrementals to synthetic fos, and finally to true forever incremental.
Speaker:If you're still doing things the old way, well, you're missing
Speaker:out on some serious benefits.
Speaker:Better performance, lower storage costs, and way faster restores.
Speaker:Perhaps this episode will cause you to rethink things a little bit.
Speaker:By the way, if you don't know who I am, I'm w Curtis Preston, AKA, Mr. Backup,
Speaker:and I've been passionate about backup and recovery for over 30 years, ever since.
Speaker:I had to tell my boss that we had no backups of the production
Speaker:database that we had just lost.
Speaker:I don't want that to happen to you, and that's why I do this.
Speaker:On this podcast, we turn unappreciated backup admins into Cyber Recovery Heroes.
Speaker:This is the backup wrap up.
Speaker:Welcome to the show.
Speaker:Hi, I am w Curtis Preston, AKA, Mr. Backup, and I have with me a guy that
Speaker:I bet is pretty dang happy right now.
Speaker:Prasanna Malaiyandi.
Speaker:Why are you looking like that?
Speaker:very confused about what I'm happy about.
Speaker:Really?
Speaker:Do I need to explain recent events that happened at your house
Speaker:Oh
Speaker:that, that.
Speaker:For those of you by the way, um, who are listening to this podcast via
Speaker:audio, we also provide video on YouTube.
Speaker:So if you wanted to see our expressions just then and really see what Curtis
Speaker:is laughing about, uh, go check us
Speaker:Yeah,
Speaker:Backup wrap up.
Speaker:so what events happened in the last 48 hours?
Speaker:my wife was in India
Speaker:Yeah.
Speaker:Visiting family and she just got back.
Speaker:Yes.
Speaker:it was good.
Speaker:Suddenly, you're so much harder to get on the phone between that and your new job.
Speaker:I don't even know you anymore.
Speaker:He's like, why would I be happy?
Speaker:I, I was really, I was, I thought it was like, I was thinking, I was like,
Speaker:was it something YouTube related?
Speaker:Was it something like I, we had watched or we had talked about and.
Speaker:No, the fact that your wife is back
Speaker:although she's also slightly jet lagged, so,
Speaker:so well, it's what, what, what's it is 12 hours difference.
Speaker:right now it is 13 and a half.
Speaker:Yeah.
Speaker:Yeah.
Speaker:So
Speaker:does
Speaker:of course she's,
Speaker:jet lagged than I do.
Speaker:So
Speaker:yeah.
Speaker:and come back, it takes me like three weeks.
Speaker:And,
Speaker:Yeah.
Speaker:have a dog, which I think some people have seen on the podcast.
Speaker:Um, it's funny because he gets jet lagged when I'm jet lagged.
Speaker:So he, his hours will shift to match with mine.
Speaker:So
Speaker:That's really adorable.
Speaker:and he would be like, Hey, I'm ready.
Speaker:Let's go, let's go.
Speaker:And then when I'm passed out, he's asleep.
Speaker:So he's
Speaker:Interesting.
Speaker:Yes.
Speaker:Your dog that's named after an Indian dessert.
Speaker:cream.
Speaker:Yep.
Speaker:Ice cream.
Speaker:Oh yeah.
Speaker:Ice cream.
Speaker:Yeah.
Speaker:Kulfi, which I have now had, um.
Speaker:your, did you know they have various flavors of Kulfi.
Speaker:They're not all just like one.
Speaker:Oh, I thought it was all mango.
Speaker:So it's met like mango Kulfi and then what?
Speaker:Coconut Kulfi.
Speaker:What are what?
Speaker:What are the other typical
Speaker:and pistachio.
Speaker:Pistachio and, and.
Speaker:I think there are other ones too.
Speaker:Yeah.
Speaker:I don't, yeah, I don't.
Speaker:I've used cardamom as a, an ingredient, you know, when I made the, the
Speaker:mango pie thing, but I don't think I know what it tastes like by itself.
Speaker:So I'll have to, I'll have to sprinkle some and, uh, see what that tastes like.
Speaker:Or maybe just go and
Speaker:Yeah.
Speaker:some
Speaker:next time you're at the Indian store.
Speaker:Actually, sometimes even the Costcos carry like a sampler pack
Speaker:Oh, yeah, so like when I go to the cash and carry down in Mira Mesa.
Speaker:the cash and carry will definitely have it.
Speaker:If you go check out the freezer section.
Speaker:Just
Speaker:Interesting.
Speaker:all before you get home.
Speaker:Yeah.
Speaker:I, oh man.
Speaker:I'm, you're, you're literally like, like my mouth is Yeah, yeah.
Speaker:Thinking about, thinking about it sounds yummy.
Speaker:Hmm.
Speaker:Uh, so we're gonna talk about something that's just as yummy.
Speaker:That is if you're into, into the backup thing, uh, we're gonna talk
Speaker:about forever incremental and sort of.
Speaker:Uh, just how big of a deal that is or was.
Speaker:Um, it, at, at this point?
Speaker:It's in, in most modern backup products that have come out in
Speaker:the last 10 years, I'd say, uh, ones that are actually new, right?
Speaker:So, you know, when we look at the products like Rubrik, Cohesity, or
Speaker:modern SaaS applications like Druva, Hyku, keep it, Alcion, uh, right.
Speaker:You know, these are all of these products.
Speaker:Have forever.
Speaker:Incremental in common, right?
Speaker:Nobody wants to do full backups anymore.
Speaker:Uh, or, or anything like full backups, right?
Speaker:So, um, so let's talk about that.
Speaker:Let's talk about what, you know, where were we, what was it like back in the
Speaker:day, and, um, and then sort of how we got the, from there to here and why it's
Speaker:such a big deal and why you should care.
Speaker:Right.
Speaker:And why this perhaps may give you a reason to finally, um, you
Speaker:know, what, what is it, shuck?
Speaker:The mortal coil.
Speaker:What, what, what, what is that?
Speaker:What,
Speaker:I'm just
Speaker:thinking are there company, and maybe as we're talking we can figure this out, like
Speaker:are there still vendors that exist that don't support forever Incrementals?
Speaker:That's actually a really good question and I'm gonna say, uh, absolutely
Speaker:right, because my concept of what a true forever incremental will be based on.
Speaker:Like a, a complete redesign of the, of the underneath.
Speaker:Right.
Speaker:Um, so the phrase I was looking for is shuttle off sh shuffle off the
Speaker:mortal coil of your old backup product,
Speaker:by the way, wait, quick story.
Speaker:So I'm on, I'm on a plane one time and this, this, um.
Speaker:This pilot, he was talking about the fact that we were about to take off and
Speaker:he referred to taking off as shuffling off the mortal coil of this earth.
Speaker:And I'm like, dude, that's a phrase for dying.
Speaker:What?
Speaker:Why the hell are you, why would you do that?
Speaker:Right.
Speaker:, it's a Shakespeare, it's from the to be or not to be soliloquy
Speaker:I,
Speaker:Hamlet.
Speaker:in all fairness though, I would say 97% of people probably had
Speaker:no idea what that phrase meant.
Speaker:They, they never even heard.
Speaker:Yeah.
Speaker:Yeah.
Speaker:And, and, and, and, and just to, just to, uh, like, you know, since I'm over
Speaker:here quoting Shakespeare, let me just put this, I'm not a huge Shakespeare person.
Speaker:In fact, it, it wasn't until, I dunno, several years ago, and we were in England
Speaker:where I, I, I got to actually see my first, um, uh, Shakespeare play, which
Speaker:was Henry V. Some people say Henry the VI don't, I think it's actually
Speaker:called Henry five, which is the, um.
Speaker:The one with the St. Crispin stay speech.
Speaker:Uh, once more into the breach, close the wall up with, with
Speaker:her English dead and all that.
Speaker:But anyway, sorry, I just bored everybody.
Speaker:Sorry.
Speaker:Sorry, you're asleep.
Speaker:Wake back up.
Speaker:Wake back up.
Speaker:We're, we're gonna talk about backup, which is so much more
Speaker:exciting than Shakespeare.
Speaker:Um, so, all right, so first let's talk about, do you want to define, uh, so what?
Speaker:let's talk about the old days first.
Speaker:that's what I, that's what I'm saying.
Speaker:That's what I'm saying.
Speaker:Let's, do you wanna define, uh, what a full and, and
Speaker:Yeah.
Speaker:an incremental backup is?
Speaker:Yeah.
Speaker:So.
Speaker:Normally when you think about backups, right, you talk,
Speaker:typically people talk about fulls.
Speaker:Incrementals, right?
Speaker:A full is basically your entire application, dataset, whatever
Speaker:it is, everything is backed up.
Speaker:Mm-hmm.
Speaker:then between doing every fulls, you would do incrementals, which are just the things
Speaker:that have changed since that last full.
Speaker:Right, right.
Speaker:you do these periodically, and normally you do incrementals
Speaker:because they will be smaller.
Speaker:So you could say do a full once a week and then incrementals during the week
Speaker:such that you can finish in time or during your backup window, and then the next
Speaker:weekend you would do like another full,
Speaker:Right, right now, slightly, I'll slightly change your
Speaker:definition or a pen or whatever.
Speaker:Um, so you specifically described what I would call a
Speaker:cumulative incremental, right?
Speaker:So a, uh, what is referred to as a sort of a regular, incremental
Speaker:or a differential incremental.
Speaker:Yep,
Speaker:it is one where it's the changes just since the last incremental or since
Speaker:the last backup of any kind, right?
Speaker:And then you
Speaker:Um,
Speaker:levels and all the
Speaker:yeah.
Speaker:Yeah, yeah,
Speaker:But yeah, at a high level, yeah, there are those three types, if you will.
Speaker:yeah.
Speaker:And, and, and then what really, um, what really what it really
Speaker:came down to is how things behaved when it came time for a restore.
Speaker:Right.
Speaker:So if you were going to restore something, then you needed the full,
Speaker:the latest cumulative, incremental, if you were doing such a thing, and
Speaker:then any incrementals since that day.
Speaker:And so, um, those of us, uh.
Speaker:I, I remember at some point we, we used to do weekly fulls.
Speaker:I remember my preferred backup design switching to monthly fulls,
Speaker:weekly cumulative incrementals, and then daily incrementals.
Speaker:So if you were, let's say three weeks into the month, you would need the full
Speaker:from the first of the month, the latest, uh, weekly cumulative, incremental,
Speaker:and then three or four days worth of.
Speaker:Incrementals
Speaker:Yeah,
Speaker:and, uh, in order to do a restore.
Speaker:. So what it meant was that you would often, the way the Restore worked is you
Speaker:would restore everything that was in the full, then you would restore everything
Speaker:that was in the cumulative incremental, and then you would restore everything
Speaker:that was in each of the incrementals.
Speaker:What that meant was there were many files that you were actually restoring
Speaker:multiple times and you were overwriting them, which was just inefficient.
Speaker:Um, and what that also meant was.
Speaker:That you had, uh, uh, duplicate data, the old way of doing backups created
Speaker:a ton of duplicate data on the tapes.
Speaker:You could say duplicate, or you could also say it's independent,
Speaker:depending on the glass.
Speaker:Half full glass, half empty.
Speaker:Well, so no, I, I understand what you're saying.
Speaker:What I'm talking about is the fulls,
Speaker:Yeah,
Speaker:the fulls over time created a ton of duplicate data.
Speaker:that's true.
Speaker:Right.
Speaker:I know what you're thinking.
Speaker:You're thinking, you're talking about the, within the, within the, the incrementals.
Speaker:I'm mainly talking about
Speaker:yeah,
Speaker:how many different copies of the exact same file were all
Speaker:over, you know, everywhere.
Speaker:and especially because most times, right, you look at sort of like a
Speaker:2% daily change rate, which maybe
Speaker:Yeah.
Speaker:a seven or 8% change rate per week.
Speaker:And
Speaker:Yeah.
Speaker:those weekly cumulatives or even the monthly fulls, right, you're
Speaker:basically backing up a bunch of data, which you don't necessarily
Speaker:need that haven't changed.
Speaker:Right, right.
Speaker:You, you, the reason we did it the way we did, it wasn't so much about independence.
Speaker:It was about.
Speaker:Minimize.
Speaker:So if you did a monthly full, followed by nothing but regular
Speaker:incrementals, you would have to load 30 tapes at the end of the day, right?
Speaker:At the end of, and even if it wasn't tapes, you would have
Speaker:to, you have to do 30 restores.
Speaker:Yeah,
Speaker:So, so the, the, the, the differential or the cumulative incremental, um,
Speaker:was about minimizing the level of effort during a restore, right?
Speaker:And then came along this thing called dedupe.
Speaker:Uh, which at least helped minimize, I think the, all of that duplicate data.
Speaker:because I think with deduplication it was nice because you didn't
Speaker:have to change what you were doing from like a policy perspective
Speaker:Right,
Speaker:perspective.
Speaker:But you got, like you said, it was able to.
Speaker:Dedupe, of course,
Speaker:right.
Speaker:in the backend and save you the storage costs.
Speaker:And that's why when, if you, if you were doing a, a typical weekly, by
Speaker:the way, nobody did the, the monthly full thing that I recommended, most
Speaker:people did weekly fulls, right?
Speaker:And so when you had weekly fulls, um.
Speaker:This is why, and you, and you had like 90 day retention.
Speaker:This is why just with even crappy dedupe, you got 20 to one,
Speaker:Yeah.
Speaker:uh, data reduction, right?
Speaker:And this is what helped, um, pay for, you know, the what to help
Speaker:created the, the market for all that, all that dedupe work, right?
Speaker:And then we could argue over, um, you know, well we're, we're
Speaker:20 to one, we're 30 to one, we're, you know, whatever, right?
Speaker:And, um, we, we saved this much more money.
Speaker:Um.
Speaker:You know, once you get rid of all those fulls, everything else
Speaker:was, was really kind of incre
Speaker:Yeah.
Speaker:incremental.
Speaker:Um, but the real, I think the real challenge was just the core backup design
Speaker:of when we go to do a restore, we have to restore, uh, data multiple times.
Speaker:Right.
Speaker:It just simply wasn't efficient.
Speaker:Well, and I think even before we get there, I'd like to cover
Speaker:Mm-hmm.
Speaker:uh,
Speaker:Okay.
Speaker:if you had the existing backup schedule, right, where I'm still doing
Speaker:weekly fulls and daily incrementals,
Speaker:Mm-hmm.
Speaker:still transferring all that data over the network.
Speaker:Right?
Speaker:And I think this is where client side deduplication would probably
Speaker:come next, in my opinion, right?
Speaker:Yep.
Speaker:it would help you reduce the amount of data you're actually
Speaker:sending over the network.
Speaker:Right.
Speaker:And that'll help save you cost as well as time.
Speaker:Yeah, that's actually a really good point.
Speaker:Thanks for bringing that up.
Speaker:So the, one of the other like, like basically what you're saying that one
Speaker:of the other, um, downsides to doing it the old way was both the transfer
Speaker:across the network as well as the impact, uh, to the clients, right?
Speaker:If you're doing a full, on a regular basis, um, you're, um.
Speaker:There's a, there's a cost to that
Speaker:Yep.
Speaker:in both, you know, you brought up bandwidth, it's
Speaker:bandwidth storage and compute.
Speaker:Right?
Speaker:Um, it's like all the things.
Speaker:There was also a, um, sort of a, like a perfect storm brewing.
Speaker:Yeah.
Speaker:So we're doing fulls on a regular basis.
Speaker:We're doing cumulative incrementals on a regular basis, and this is
Speaker:how we were doing things, and then along came virtualization.
Speaker:Why was that a problem?
Speaker:Because a lot of the data was
Speaker:I,
Speaker:the same, right?
Speaker:When you look across all of your VMs, they're probably spun up
Speaker:from the same golden copy, of your
Speaker:Well, the,
Speaker:and
Speaker:yeah, well, the.
Speaker:amongst them.
Speaker:Well, that, that's a, this is another example of, that's a great answer.
Speaker:It's not the question I was asking.
Speaker:It's
Speaker:Um,
Speaker:resource on the backend because
Speaker:that's what I was going after.
Speaker:Yeah,
Speaker:and the compute, right?
Speaker:You don't have your
Speaker:yeah,
Speaker:instance
Speaker:yeah.
Speaker:That's, that's what I was going after.
Speaker:Right?
Speaker:So, so that was a real problem when VMware got really popular and we just
Speaker:kept doing the same old thing because.
Speaker:Remember, for a long time we backed up VMware.
Speaker:We backed up the physical, the virtual machines, just like
Speaker:they were physical machines.
Speaker:Yes.
Speaker:Guest ba. We just basically installed an A, an agent and every guest,
Speaker:and that's how we did the backups.
Speaker:And, you know, and what happened was you'd do just randomly, you'd end up
Speaker:doing a bunch of, at the same time.
Speaker:Which worked fine when we were 20 individual machines, but now we're 20
Speaker:VMs all on the same physical machine, and that just broke, like virtualization
Speaker:just broke back up overnight.
Speaker:Right.
Speaker:Um, the latest thing to break back up overnight is, um, containers.
Speaker:Yep.
Speaker:That's a whole other thing.
Speaker:We're not talking about that right now, but, um, the
Speaker:so
Speaker:ahead.
Speaker:though, this is what sort of VMware, Plus the backup vendors to figure out
Speaker:is there something better we can do
Speaker:Yeah.
Speaker:order to help satisfy backup needs?
Speaker:Because it's not going away, right?
Speaker:You have to
Speaker:Right,
Speaker:VMs.
Speaker:right, right.
Speaker:So the the real key I think, I think the real goal, I. Like, like if we could
Speaker:go back in time and I, I don't know.
Speaker:I don't know.
Speaker:This is one of the things where like, I don't know if we, I, I wasn't in the
Speaker:rooms right when they were designing this stuff, but I'd like to think that
Speaker:somebody said, you know, if, because my memory is, we had issues with tape.
Speaker:The, the, there was the fundamental mismatch of speed tape.
Speaker:You know, people thought it was slow, it was actually fast.
Speaker:That was a problem.
Speaker:And we need, and so we started doing disc staging.
Speaker:We started doing disc backups because of that, right?
Speaker:And then DDU came along and DDU said, Hey, we can make disc better.
Speaker:We can make disc, you know, more affordable.
Speaker:And, and so, so that was, and then it just sort of happened.
Speaker:But even if.
Speaker:We if, if that wasn't the case, I think the design of backups ultimately ending
Speaker:up being, uh, what we're talking about today, which is forever incremental,
Speaker:it, it accomplishes multiple purposes.
Speaker:One is it, it, it solves the issues that we have with tape.
Speaker:Because in order for forever incremental work, you've gotta be using disc, right?
Speaker:Uh, for, for true forever, incremental work to work, you've gotta be using disc.
Speaker:Um, and then the other thing is that it, it solves the, the cost issues.
Speaker:It solves the performance issues, restore, you know, it does a lot of things right.
Speaker:Um,
Speaker:you first define what forever incremental is?
Speaker:'cause I don't
Speaker:I don't want to,
Speaker:that.
Speaker:Before, before we get to forever incremental, we gotta talk about
Speaker:something that was, that was a stop gap technology bef to get from we'd,
Speaker:we wanted to get to a point where we weren't doing fullsS anymore.
Speaker:Everybody agreed that full stunk, except especially when we're
Speaker:talking about virtualization, right?
Speaker:What was the stop gap technology that backup vendors came out with before
Speaker:we went to the whole For incremental?
Speaker:Uh, they created what they called synthetic fulls,
Speaker:Mm-hmm.
Speaker:which is basically you are.
Speaker:Creating a full on the backend, but you're sending data and stitching it together and
Speaker:piecing it together to make it look like what the full would look like, because you
Speaker:Yeah.
Speaker:backups were on that
Speaker:Right.
Speaker:So it's like we already know, we, we already know what the files are in
Speaker:the full, because, you know, we did it on we, we did another incremental.
Speaker:So we know what's deleted.
Speaker:We know what's new.
Speaker:We know what a full would look like.
Speaker:If we went and made one, why don't we just make one over here
Speaker:without having to do it over there.
Speaker:Um, and there were a couple of different ways to do that.
Speaker:Uh, there was the backup vendor way and there was the, um, the, well, the
Speaker:backup software vendor way, and then there was the backup hardware vendor way.
Speaker:Yeah.
Speaker:I'll talk about the backup software vendor way first.
Speaker:It's like, we know what all the files should be on a full,
Speaker:we know where all of them are.
Speaker:We're just gonna copy them all together.
Speaker:You know, in one image, and then we're gonna copy that over to a new
Speaker:set of tapes or a new set of disc
Speaker:Mm.
Speaker:so that we essentially create an actual full, um, without actually having to
Speaker:transfer any data around the network.
Speaker:It might be transferring it, uh, well it is transferring
Speaker:it within the backup system.
Speaker:Right.
Speaker:And then there was the, the, and, and that took a certain amount of time.
Speaker:Right.
Speaker:Um, and then.
Speaker:A, a vendor that you might be familiar with came out with
Speaker:a different way to do it.
Speaker:What was that?
Speaker:And basically, um, they basically said, just tell us how you wanna
Speaker:stitch everything together,
Speaker:Yeah.
Speaker:it all together.
Speaker:Yeah.
Speaker:So it was like, it was like an API, right.
Speaker:You know where all the stuff is.
Speaker:We have all the stuff.
Speaker:Just tell us which stuff you want, where, and we'll go.
Speaker:Yep.
Speaker:And so they could make essentially a new full.
Speaker:When in reality it's just a bunch of pointers to what was already there.
Speaker:Right.
Speaker:this
Speaker:And it was really quick.
Speaker:Right,
Speaker:it was really quick because the key is, especially on a de-duplicated system,
Speaker:you're not actually copying the data.
Speaker:It's all pointer manipulation.
Speaker:right, right.
Speaker:moving any data, you're just changing metadata, which is very quick, very fast.
Speaker:That was a great singing group from the nineties,
Speaker:What
Speaker:I
Speaker:metadata.
Speaker:manipulation.
Speaker:But I'm, anyway, so yeah, so that was sort of a stop gap solution.
Speaker:It allowed people to stop doing real fulls, solve the issue of the, the,
Speaker:the load on the client, solve the issue of the load on the network.
Speaker:But we still, we were still creating fulls.
Speaker:So the concept of forever incremental.
Speaker:And, and true forever incremental is that we first have to design a backup product
Speaker:that just no longer needs repeated fulls.
Speaker:Right?
Speaker:And, and by the way, this generally does only apply to, um.
Speaker:Uh, file system backups for the most part.
Speaker:Um, not database and application backups.
Speaker:Most of them still require repeated fullss.
Speaker:Um, you know, we'll, we'll, some of, some of them have forever incremental, but.
Speaker:I would also say, or would you also say for virtualized
Speaker:workloads, this also applies 'cause
Speaker:Um, oh, you mean, uh, the, the backup of the di of the system itself?
Speaker:Yes.
Speaker:Yes.
Speaker:So, well, so I'll, so if I, if, if I was to use a term that would apply to,
Speaker:to both of them, I'd say volume-based.
Speaker:Yeah.
Speaker:Backups.
Speaker:Right?
Speaker:So either a file system or the, or a,
Speaker:Disc.
Speaker:file or a VDK file, right?
Speaker:Um, and the idea is that we're just gonna do one full, and then
Speaker:we're generally going to be doing block level incremental backups.
Speaker:Um, not just incremental, but block level incremental.
Speaker:What's the difference between those?
Speaker:Well, a block level incremental allows you to look at changes
Speaker:within, a specific object.
Speaker:As an example, say you had, uh, 10 gigabyte PST.
Speaker:File representing your email and instead of doing a normal incremental, which
Speaker:would transfer the entire 10 gig file over when you back it up using traditional
Speaker:incrementals, block level incremental would just take the changes in within that
Speaker:10 gig PST file and transfer that over.
Speaker:Exactly, and so it, it, it's another data reduction technology like deduplication.
Speaker:And it, it, it just, it, I I think it's a, it was an order of
Speaker:magnitude reduction in the number of bytes that has to be transferred.
Speaker:And, and the way this worked in, in, uh, file system backups was that
Speaker:you had to have like an API into the actual blocks, the, the discs, right.
Speaker:Uh, which.
Speaker:Was somewhat problematic.
Speaker:The, the, the easier way to get this was in the virtualization
Speaker:world where you had an API, right, like the change block tracking API
Speaker:Mm-hmm.
Speaker:you can just, the backup app can just say to the, the virtualization
Speaker:vendor, Hey, I'm here to back up again.
Speaker:Just give me a pointer to all of the blocks that have changed
Speaker:since the last time I was here.
Speaker:Yep.
Speaker:Um, and then we can transfer just those change blocks, but.
Speaker:But.
Speaker:importantly, for a true increment, for a true forever incremental, um, to, to, to,
Speaker:for me to think that it's a truly forever incremental is that you're going to change
Speaker:how the data is stored on the backend.
Speaker:So there really isn't a concept of full, right?
Speaker:Um, the way that you're storing each new backup is that each new backup from a
Speaker:restore perspective behaves like a full.
Speaker:Yeah.
Speaker:Right.
Speaker:You
Speaker:Yeah.
Speaker:want to like put that in?
Speaker:Go ahead.
Speaker:so I was just gonna put it in my own words, which
Speaker:Yeah.
Speaker:basically you're able to send only the changes and end up with
Speaker:the full on the backend, such
Speaker:Right.
Speaker:each one is a full, they can be independently restored without
Speaker:any dependencies on others.
Speaker:Yeah.
Speaker:So you don't have this idea of restoring the full and restoring
Speaker:the incrementals individually.
Speaker:Right.
Speaker:You just restore the late, you know, it's sort of, it's, it's sort of like take what
Speaker:we used to do, take the synthetic full.
Speaker:Every time we do an incremental, we're essentially creating a new synthetic full.
Speaker:Yep,
Speaker:How's that?
Speaker:yep.
Speaker:Yeah.
Speaker:Yeah, because if you think about it, it's just kind of changing
Speaker:where the processing happened,
Speaker:Right.
Speaker:Because with Synthetic fulls, the client was telling the storage, okay, these
Speaker:are the pieces I want you to stitch together, and it might have been pieces
Speaker:across multiple different backups,
Speaker:Mm-hmm.
Speaker:right?
Speaker:In the case of an incremental forever, it's like, Hey, you know
Speaker:your last backup go apply these changes on top of that last backup
Speaker:Yeah.
Speaker:Yeah.
Speaker:offsets.
Speaker:Right.
Speaker:Um, and go ahead.
Speaker:it needs to be a system that allows you to keep those copies as independent,
Speaker:What do you, what do you mean by that?
Speaker:um, uh, backup technologies which say, which claim to be incremental forever.
Speaker:Mm-hmm.
Speaker:They only keep the latest copy,
Speaker:Right.
Speaker:for
Speaker:I.
Speaker:you have a full, you apply the changes, you now get a new full.
Speaker:That old full is gone from the system.
Speaker:Hmm.
Speaker:Yeah.
Speaker:have the most recent copy, right?
Speaker:And normally for backup, because you want to be able to go back further in
Speaker:time, you need to make sure you are preserving those previous copies as well.
Speaker:And so it
Speaker:Yeah.
Speaker:require a technology like deduplication, which allows you to efficiently store
Speaker:multiple copies, snapshots, whatever technology is available on the backend
Speaker:in order to make that possible.
Speaker:Yeah, so snapshots and replication.
Speaker:A great example of forever incremental technology, right?
Speaker:Each new, um.
Speaker:Each time you do a snapshot and a replication, you get a new, something
Speaker:that looks like a full, uh, when really all you only transferred was the, the,
Speaker:you know, the bites that have changed.
Speaker:And then, um, I can, I can immediately think of when I was,
Speaker:when you were talking, I was like, what is he talking about?
Speaker:And then I realized what you were talking about.
Speaker:So here's an example of forever incremental that isn't what we're
Speaker:talking about, and that's replication.
Speaker:It
Speaker:Yep.
Speaker:Right.
Speaker:Re just regular replication without anything
Speaker:Yep,
Speaker:beside it.
Speaker:You end up with each new, every time you do a a, an if, if you want to call
Speaker:it that, in an incremental backup, all you're left with is the latest
Speaker:version and you can't go back in time.
Speaker:You know, you know what I was actually thinking of when I made those comments?
Speaker:Oh, what were you,
Speaker:Oracle incremental merge
Speaker:oh, okay.
Speaker:because Oracle incremental merge applies to changes on top of a full, but it
Speaker:does not preserve the previous versions.
Speaker:Right.
Speaker:Yeah, that's a good point.
Speaker:Um, which, which wouldn't it, that's really just a, a type
Speaker:of, synthetic full, right?
Speaker:Yeah, yeah, I didn't even realize that, by the way.
Speaker:Um, so how do you preserve the history?
Speaker:That's why
Speaker:Do you just.
Speaker:do you need a application that, or a storage system that
Speaker:allows you to take snapshots or
Speaker:okay.
Speaker:copy of the file before it
Speaker:gotcha.
Speaker:Why do
Speaker:where was it?
Speaker:Curtis?
Speaker:Why do I still remember these things?
Speaker:It's why you're here?
Speaker:So we've defined what a
Speaker:Mm-hmm.
Speaker:a little bit about the benefits,
Speaker:Yeah.
Speaker:Being able to quickly restore because you have the full copy, uh, being independent,
Speaker:um, not having to transfer all the data.
Speaker:Right?
Speaker:Um, can
Speaker:And also also cost savings.
Speaker:Right, right.
Speaker:Yeah.
Speaker:you think of any cons?
Speaker:Well, the, the only thing you know, one of the things I said
Speaker:in the beginning was that.
Speaker:In order to do forever, incremental, successfully, you've
Speaker:got to store the data on disc.
Speaker:And, and again, not, not to be, you know, pro tape and all that stuff,
Speaker:but we did an episode relatively recently where we talked about how you
Speaker:have to, uh, protect the backup disc.
Speaker:The only con that I can think of is that your entire backup is just sitting
Speaker:there ready to be attacked, right.
Speaker:Yeah,
Speaker:Um.
Speaker:I have, I have a
Speaker:Go ahead.
Speaker:You have, you have a con, okay?
Speaker:So having worked with vendors, and I'm not saying this has happened, but
Speaker:it is a potential thing you should think about is when you have forever
Speaker:incrementals, You're always re.
Speaker:Copying over the existing backup
Speaker:Mm-hmm.
Speaker:bug that somehow makes, that corrupts a part of your backup, right?
Speaker:You've now impacted how many versions back because you're never getting a
Speaker:full copy of that data again, right?
Speaker:It's always
Speaker:Right,
Speaker:And so
Speaker:Yeah.
Speaker:be one of the cons a
Speaker:Yeah.
Speaker:approach.
Speaker:That was, that was a, um, a critique of ddu, uh, because that
Speaker:has the same issue, a critique of dedupe when Dedupe first came out.
Speaker:And all you can say is that you just do your best to ensure
Speaker:that that's not happening.
Speaker:And that's why we do fingerprinting and, and, you know, integrity
Speaker:checks and all that kind of stuff.
Speaker:your backup.
Speaker:Yes.
Speaker:And also test your backups.
Speaker:Right.
Speaker:Test your backup, test your backup, test your backups.
Speaker:Um, but go ahead.
Speaker:I do have another one too.
Speaker:If you, unless you
Speaker:Oh, you do?
Speaker:Okay.
Speaker:Go ahead.
Speaker:The other one I also wanted to mention is, right if Forever incremental backups
Speaker:are so great, why isn't everyone using it?
Speaker:Mm-hmm.
Speaker:one of the challenges is it requires vendor support.
Speaker:Yeah.
Speaker:application to support it.
Speaker:So VMware is supported for virtualization.
Speaker:companies support it for their own applications, but
Speaker:not everyone supports it.
Speaker:Not everyone makes it available.
Speaker:Yeah.
Speaker:so you sort of are limited in terms of what applications, what
Speaker:APIs are available, and can you get the changes that you need in
Speaker:order to be able to create your forever incremental in the backend.
Speaker:Exactly, and, and know, I can think of modern database applications
Speaker:that do support it, right?
Speaker:Um, I can think, for example, uh, when you back up Salesforce, right?
Speaker:You're backing up at the object level and it's very easy to.
Speaker:Uh, and, and let me rephrase that, because Salesforce uses
Speaker:the term object to mean something very different than what we mean.
Speaker:Um, because why should we all use the same terms?
Speaker:of course.
Speaker:A Salesforce object is like the user's table, right?
Speaker:Um, so I'm talking about like, I made an update to the, to a, you know, a
Speaker:field in a record in a table, and, um.
Speaker:Uh, you can back up that change as right, and, and, and you can do it incrementally
Speaker:or you can do it independently.
Speaker:And so it would be very easy to store all of those, those
Speaker:objects in a, in a way that.
Speaker:It, you know, would, would foster this idea of a forever incremental.
Speaker:Um, whereas like Oracle for example, there really isn't a way to do it.
Speaker:The, at best what we can get out of Oracle is the concept of a, um, synthetic full,
Speaker:uh, and the synthetic full that they have.
Speaker:Has the issue that you, that you mentioned before, whereas once you
Speaker:do the synthetic full, you don't have the previous incrementals.
Speaker:Um, so yeah, it, that is, that is definitely a, um, it, it
Speaker:requires your backup vendors to support it, and it requires your
Speaker:application vendors to support it.
Speaker:Yeah.
Speaker:Absolutely.
Speaker:I can't remember where I was going before you brought up that other, um, I, I,
Speaker:I will say this, that, you know, the title of this is, you know, how Forever
Speaker:Incremental Backup changed the world.
Speaker:It, it, it really has, right?
Speaker:The idea, when you look at these modern backup products, the
Speaker:idea of doing a, of a, of a full backup is just, it's just gone.
Speaker:Right?
Speaker:Um, all of the downsides to doing a regular full backup.
Speaker:Are, are gone.
Speaker:Uh, the, the, the, the, the compute considerations, the
Speaker:network considerations, the storage considerations, they're all gone without
Speaker:any negative like impact to the restore.
Speaker:In fact, it's the opposite.
Speaker:The restore is now, since we know what a full looks like when we go to do the
Speaker:restore, that is to going back to Harken, back to the original way we did restores.
Speaker:We're no longer restoring the same file multiple times during a single restore.
Speaker:Why would we do that?
Speaker:We know where all the files are, we know where all the blocks are, whatever
Speaker:it is that we're, we're restoring.
Speaker:We know what the latest version of that block here file is, and
Speaker:we're just gonna restore that one.
Speaker:I have a story.
Speaker:Um, how you do forever, incremental Matters.
Speaker:The original, the OG, forever incremental product is TSM.
Speaker:mm-hmm.
Speaker:And when, and the way it was implemented back then was to tape.
Speaker:And they, tried to bring this concept of forever incremental to the world without
Speaker:changing the underlying storage technology because back then it was, it was, uh.
Speaker:tape.
Speaker:It was, uh, if you, if you brought disc into it, it was, it was crazy talk, right?
Speaker:So the, so the, so, so they can get a lot of credit for trying to
Speaker:accomplish this, this, um, this idea.
Speaker:But I, I still think when we look back to TSM of old, I, I think it was, I
Speaker:think it was a failed attempt because my earlier statement of the only proper way
Speaker:to do forever incremental is to do it on disc when they were doing it on tape.
Speaker:What it meant was that when you, um, when you went to go to do a restore, the, the,
Speaker:the full was on like 5,700 tapes, right?
Speaker:sounds worse than just doing your traditional fulls and incrementals.
Speaker:Yeah, well, it was, and I remember, I, I remember doing a, there was a one
Speaker:particular restore that I remember doing using TSM in a production environment.
Speaker:And I, I don't remember the actual numbers, but I'll, I'll just use like,
Speaker:uh, I'll just use fake numbers just to, to, to to, to describe what it was like.
Speaker:It was like we were doing like a 10 gigabyte restore and it took like.
Speaker:Two weeks because of how many hundreds of tapes that it was.
Speaker:Thousands of tapes.
Speaker:Um, because of the number of files, the number of tapes that they were
Speaker:on, and how long it takes to load and unload an individual tape.
Speaker:Right?
Speaker:Um, and they had technologies, uh, called reclamation and they, they,
Speaker:they had a number of technologies that tried to mini minimize this issue.
Speaker:Um, but
Speaker:It still existed.
Speaker:well, yeah, it still is existed.
Speaker:And also some people, they would run out of time in the day to do reclamation.
Speaker:Reclamation is where you take a bunch of tapes that, um, like they were, that
Speaker:most of the files on those, on those tapes were no longer needed because
Speaker:they, they'd expired and then you take, yeah, I've got a bunch of files that are
Speaker:10% and then you copy them to one tape
Speaker:It's
Speaker:to that's now a hundred percent.
Speaker:It's like garbage collection, but on tape.
Speaker:And it was a very, um, labor intensive process and, um, IO intensive process.
Speaker:And as a result, it took a long time.
Speaker:And so some people would, would turn it off.
Speaker:Um, and, uh, I think that's what happened in, in, in this case.
Speaker:And so there, there were literally just, I don't rem there were so many
Speaker:tapes that we had to restore and it, you know, it sounds like I'm bagging
Speaker:on T-S-M-T-S-M is still around.
Speaker:It's now called, uh, spectrum Protect.
Speaker:Yeah, but the addition of bringing in disc, I think makes this, this whole
Speaker:issue kind of, uh, not a big deal,
Speaker:Yeah.
Speaker:but, um, but yeah, I still think that you look at all of the modern backup
Speaker:products that have come out in the last 10 plus years, and they're all like this.
Speaker:They're all one full, followed by a whole bunch of incrementals, and
Speaker:the world is a better place for it.
Speaker:And, um, you young whippersnappers have never had to deal with
Speaker:tape
Speaker:incremental backups.
Speaker:You just don't know
Speaker:what
Speaker:what you uh, yeah.
Speaker:You don't know the pain that you, that Yeah.
Speaker:That you had.
Speaker:You know, it's funny, this morning I was hanging out with some people and I
Speaker:was talking to 'em, and they're, it's this lady who shares my birthday, but
Speaker:she's about 15 years older than me,
Speaker:Hmm.
Speaker:we started talking about all of the technology that we
Speaker:remember when it came out.
Speaker:Right.
Speaker:And the first, the first.
Speaker:Piece of technology I remember being introduced in my life was the, the VCR
Speaker:Hmm.
Speaker:and it, it came, they, I remember them wheeling one into the classroom
Speaker:Hmm.
Speaker:in my school.
Speaker:And it was gigantic.
Speaker:Right.
Speaker:It was like, and it was the kind where you put the tape in and push it down, right?
Speaker:Yeah.
Speaker:Um, anyway, this is, this is like that, this is like the
Speaker:way, the way we did backups.
Speaker:And by the way, you know.
Speaker:I'm, I'm glad I, I'm glad I remembered this because you asked a question earlier.
Speaker:Why is it that not everybody does things in this new way?
Speaker:Yeah.
Speaker:Because backup is the stickiest application in the data center.
Speaker:Yep.
Speaker:So many people are still using the same backup application
Speaker:that they were 15 years ago.
Speaker:Some people change their applications like they change
Speaker:their underwear, but other people.
Speaker:They just, it, it's a pain.
Speaker:It's costly, it's risk, uh, it's fraught with risk and, and, and
Speaker:difficulty and a giant learning curve.
Speaker:And during that learning curve, your, your data's at risk.
Speaker:And so most people just, they'd rather stick with the ugly, old backup product.
Speaker:it.
Speaker:Because it works.
Speaker:Yeah.
Speaker:And, and, and so it, it, it gave birth to incrementally helpful
Speaker:technologies like deduplication.
Speaker:Don't change anything.
Speaker:Just send your backups over here instead of sending your backups over there.
Speaker:And we'll get rid of the duplicates and we'll make it sort of look like a,
Speaker:an incremental forever, but just keep doing it the dumb old way because we
Speaker:don't want you to have to change your backup product in order to get to,
Speaker:and, and I guess this is, you know.
Speaker:If, if you're one of those people that's still using the backup, the
Speaker:same backup product that you had 15 years ago and you're still using it
Speaker:the same way you were using it 15 years ago, and you're not using incremental
Speaker:forever as a, a true technology in your world, maybe time to think about it.
Speaker:but I, I would probably say.
Speaker:Newer workloads, people are probably using that, right?
Speaker:Because once again, environments are a mishmash of technologies, right?
Speaker:Yeah.
Speaker:people probably do have those old databases, old applications,
Speaker:which they are using an old infrastructure backup infrastructure
Speaker:for, and for their virtualization or other newer workloads, they're
Speaker:probably using something different.
Speaker:Yeah.
Speaker:Uh, remember I did that, I did that survey a few, uh, months ago and
Speaker:I asked them how many backup apps they had, and 30% had 30, I think
Speaker:more than 30% had three or more backup apps in their environment.
Speaker:Yep.
Speaker:And it's because of what you're talking about.
Speaker:They have, they start using.
Speaker:Containers and their current backup app doesn't know what to do.
Speaker:And so they use that for the new technology and yeah.
Speaker:Yeah, that's very common.
Speaker:And I guess what I'm saying is maybe it's time to check out the water of
Speaker:the, of, of truly incremental backups and, and experience all the beauty.
Speaker:Yeah.
Speaker:All right.
Speaker:Well thanks for the chat again.
Speaker:Anytime Curtis.
Speaker:Anytime.
Speaker:All right.
Speaker:And we wanna thank our listeners.
Speaker:Yo, you're why we're here, you know, otherwise, just two guys sitting here
Speaker:talking over a couple of microphones.
Speaker:Uh, that is a wrap.