Speaker: 00:00:00

Ladies and gentlemen, welcome to another riveting episode of the

Speaker: 00:00:03

data driven podcast. Today, we're diving into the

Speaker: 00:00:07

fascinating and sometimes terrifying world of IT security.

Speaker: 00:00:11

Joining us is none other than the formidable Kevin Latchford, an

Speaker: 00:00:15

expert in safeguarding our digital lives. We'll be discussing

Speaker: 00:00:19

the vulnerabilities of large language models. Yes. Those clever

Speaker: 00:00:23

algorithms behind chatbots and virtual assistants like yours

Speaker: 00:00:26

truly. Are these digital wordsmiths a blessing or a

Speaker: 00:00:30

potential security threat? Stay tuned as we unravel

Speaker: 00:00:34

the secrets and risks lurking in the code.

Speaker: 00:00:41

Hello, and welcome back to Data Driven. I'm your host,

Speaker: 00:00:45

Frank Lavinia. And while Andy is out

Speaker: 00:00:48

playing on vacation, I had the opportunity to invite our guest,

Speaker: 00:00:52

Kevin Latchford, who recently spoke at the Northern Virginia

Speaker: 00:00:56

Cyber Meetup on securing large language

Speaker: 00:00:59

models and the most pressing exploits that are out there.

Speaker: 00:01:03

What really got me interested in this is that I saw a paper, I think

Speaker: 00:01:06

it was published by NIST, talking about vulnerabilities and red

Speaker: 00:01:10

teaming against large language models. So welcome to the

Speaker: 00:01:14

show, Kevin. Great pleasure to be here.

Speaker: 00:01:17

Awesome. Awesome. So for those that don't know, I kinda know what red teaming is

Speaker: 00:01:21

because my wife works in the security space. But for those that are not necessarily

Speaker: 00:01:24

familiar with the term, what is red teaming versus blue teaming?

Speaker: 00:01:28

Well, red teaming versus blue teaming is basically it's,

Speaker: 00:01:32

basically in military parlance that we called opt for, the opposing

Speaker: 00:01:36

force. The opposing force often is called the red

Speaker: 00:01:39

force. Blue force is your, friendlies.

Speaker: 00:01:43

And, basically, this is offensive cybersecurity,

Speaker: 00:01:47

whereas blue teaming is is defensive

Speaker: 00:01:52

cybersecurity. The tools are different. The

Speaker: 00:01:55

methodologies are the methodologies are different, but they come together for a common

Speaker: 00:01:59

purpose. The common purpose is the assurance of the

Speaker: 00:02:02

confidentiality, the integrity, and the accessibility

Speaker: 00:02:06

of a computer network, computer system,

Speaker: 00:02:11

application, whether it be natively hosted or web.

Speaker: 00:02:14

Interesting. Interesting. So we're not you you know, we talked

Speaker: 00:02:18

in the virtual green room. People don't think of

Speaker: 00:02:21

LLMs as a major security flaw. And I think that

Speaker: 00:02:25

I find that a little dangerous, and I think you're gonna tell me it's very

Speaker: 00:02:28

dangerous. Well, it could be quite it could be quite dangerous, you

Speaker: 00:02:31

know, to the point of, you know, frankly, near deadly,

Speaker: 00:02:35

depending on what you use it for. The big thing, there's a lot

Speaker: 00:02:39

of misconceptions about AI and l the LLMs

Speaker: 00:02:43

that is they're based on. Number 1, it is not

Speaker: 00:02:47

conscious. Right. 2, it is not a toy,

Speaker: 00:02:51

and number 3, it is literally,

Speaker: 00:02:55

something that is at present, not

Speaker: 00:03:00

not necessarily, you know,

Speaker: 00:03:03

fully understood, in in regards to the integrations

Speaker: 00:03:07

and the things it may need to work with. You can't treat an

Speaker: 00:03:11

LOM exactly the way

Speaker: 00:03:15

you would treat, another enterprise application that's a little

Speaker: 00:03:18

bit less opaque because LLMs are opaque on the on the

Speaker: 00:03:22

inside, but you have to, for the purposes of

Speaker: 00:03:26

security regulation, for the purposes of security compliance, you

Speaker: 00:03:30

have to treat them, though, nonetheless, the same as any other

Speaker: 00:03:33

enterprise application. So that's the conundrum. The conundrum

Speaker: 00:03:36

is, how do you see into something that's

Speaker: 00:03:40

opaque? And the way you do it is kind of

Speaker: 00:03:44

what I discussed in that in that, in that paper, in

Speaker: 00:03:47

that presentation, as well as one of the biggest

Speaker: 00:03:51

vulnerabilities and that being jailbreaking. Yeah. So tell me about that

Speaker: 00:03:55

because there's been a lot of, concerns

Speaker: 00:03:58

about jailbreaking and, and I've noticed that

Speaker: 00:04:02

the public facing GPTs have a ridiculous amount

Speaker: 00:04:06

of safeguards around them to the point where, you know, if you

Speaker: 00:04:10

ask it to describe something. Right? I asked it to talk

Speaker: 00:04:13

about the to generate an image for the Butlerian jihad,

Speaker: 00:04:18

right, which is a concept in June. And, obviously, I think the jihad

Speaker: 00:04:21

term really freaked it out. Listen. I'm sorry. I can't do that.

Speaker: 00:04:25

So there's clearly I understand why these safeguards are in place, but it seems

Speaker: 00:04:29

like it's not that hard to get around them. Well, not

Speaker: 00:04:32

necessarily. It depends on the model you're working with. For those of you

Speaker: 00:04:36

who may use private LLMs because a

Speaker: 00:04:40

wider issue on that is actually the DOD and many other government

Speaker: 00:04:43

agencies actually prohibit the usage of public LLM

Speaker: 00:04:47

systems, public AI, because they're concerned about unauthorized

Speaker: 00:04:51

linkages as well as, data point model

Speaker: 00:04:55

poisoning, prompt injections, things like

Speaker: 00:04:58

that. So you often you're using these private elements. Several of these are

Speaker: 00:05:02

uncensored. Right. Which means they do not have those safeguards.

Speaker: 00:05:06

The ones that you see on the public space are supposed to have those safeguards,

Speaker: 00:05:10

but you're never a 100% sure they're working because they may have been

Speaker: 00:05:14

corrupted. In their regards to jailbreaking,

Speaker: 00:05:17

jailbreaking is basically you're getting it to do something

Speaker: 00:05:21

it's not supposed to do by either, a, breaking the guardrails,

Speaker: 00:05:26

or by, b, influencing it

Speaker: 00:05:29

through almost methods of interrogation to

Speaker: 00:05:33

kind of break it down and make it talk. So

Speaker: 00:05:37

it it literally is almost like that. So for those of you who, you know,

Speaker: 00:05:41

kind of look at the it's it's kind of a there there's a great,

Speaker: 00:05:47

neurophilosopher. His name is Jay Fodor and Nernschild named Richard Searle

Speaker: 00:05:51

discussing the the philosophy of the mind as it applied to, computer

Speaker: 00:05:54

technology. Several of the arguments that they say, well, the brain is like a

Speaker: 00:05:58

computer. Yeah. You can kinda treat it like a human mind

Speaker: 00:06:02

in the way you approach it in your prompts, but it isn't exactly the same.

Speaker: 00:06:06

Once again, as I say, it is not conscious. It is not, and and it

Speaker: 00:06:09

operates under a very strict set of parameters.

Speaker: 00:06:12

But that being said, yes, you can literally interrogate it to do that.

Speaker: 00:06:16

I'm not gonna say here, unfortunately, how,

Speaker: 00:06:20

because, one, there are security reasons why we would

Speaker: 00:06:24

not do that, a. And, b, there's also I mean,

Speaker: 00:06:28

literally, in my presentation, that is all the news that has

Speaker: 00:06:31

come to Academia and much of the industry

Speaker: 00:06:35

today. There are new ones out there, but they haven't been discovered

Speaker: 00:06:39

yet. Right. So I many ways to jailbreak. Yeah. And I was thinking, like

Speaker: 00:06:43

so one of your slides I have pulled up here is, like, the top 10

Speaker: 00:06:46

threats to LLM applications. I didn't think there were as many as

Speaker: 00:06:49

10. So I knew that there were. I also know that

Speaker: 00:06:53

data poisoning, for me, as a data scientist, data engineer,

Speaker: 00:06:57

my first look at this when I saw this, aside from

Speaker: 00:07:01

the g whiz bang factor of LLMs, was,

Speaker: 00:07:05

wow. This isn't the data that trains this is a huge attack surface.

Speaker: 00:07:09

And then when I first said that, people thought I was a tinfoil hatter.

Speaker: 00:07:13

Right? And then slowly but surely, you're seeing research papers come

Speaker: 00:07:16

out saying, like, no. We have to treat kind of the data as part of

Speaker: 00:07:19

a secure software supply chain, which is an

Speaker: 00:07:23

interesting concept because data people tend not to it's something they don't

Speaker: 00:07:27

think about security. They think about security different. Is that a fair

Speaker: 00:07:31

assessment in your that you've seen?

Speaker: 00:07:35

Supply chains and the integrity of

Speaker: 00:07:38

data is something that is not often, it

Speaker: 00:07:42

seems, given the respect it's probably due. To be

Speaker: 00:07:46

honest, I don't think so. In my own experience, I see it.

Speaker: 00:07:49

It's not I guess one would say maybe it's not

Speaker: 00:07:53

necessarily consistent. Maybe that's the fair way to put it. That's a

Speaker: 00:07:56

really good way to put it. Yeah. And, I mean, right now, we're just now

Speaker: 00:08:00

getting into discussion of, SBOM, software

Speaker: 00:08:03

bill bill of materials Okay. Just for regular applications.

Speaker: 00:08:07

I mean, it's a whole another level with LLMs and the

Speaker: 00:08:11

models they're trained on, the models that these systems are trained on.

Speaker: 00:08:15

So, yeah, that there's very much. So you have to make sure you're getting it

Speaker: 00:08:18

from the right source, and you have to make sure that it hasn't been tampered

Speaker: 00:08:21

with because it could very well be tampered with.

Speaker: 00:08:25

It's not necessarily that hard. Right. Right. You

Speaker: 00:08:28

could you could poison it with just one little segment of changing the

Speaker: 00:08:32

the thing and across across 5 gigs of let's just say 5

Speaker: 00:08:36

gigs. You know, that'd be like looking for a needle in the haystack.

Speaker: 00:08:40

Precisely. In fact, that's what I talk about with the cockpit example that I

Speaker: 00:08:44

gave. If I teach that l and to make sure that every time it puts

Speaker: 00:08:48

in code to put in this malicious code that is a backdoor

Speaker: 00:08:51

Right. Well, okay. It will do that. Every time somebody does,

Speaker: 00:08:55

it embeds it into software code that is returned in the output for

Speaker: 00:08:59

the prompt. If it does that, and let's say this

Speaker: 00:09:02

is handed amongst several things, different

Speaker: 00:09:06

applications, different solutions. Well, then if

Speaker: 00:09:10

people take that that

Speaker: 00:09:13

solution, that application, and it's in their software bill of

Speaker: 00:09:17

materials, and then it gets distributed. Open source often

Speaker: 00:09:21

gets proliferated very quickly. Right. And then it finds itself in

Speaker: 00:09:24

there. You have a log floor 4 j situation.

Speaker: 00:09:28

Right. Very similar except for the fact this thing

Speaker: 00:09:32

is semi self executing. Now if it's semi self

Speaker: 00:09:36

executing, you have a problem. You have a

Speaker: 00:09:39

big problem. And I know I I just generally in industry. Now, obviously, you you

Speaker: 00:09:43

spoke with the Northern Virginia. You're based in Northern Virginia. Northern Virginia is

Speaker: 00:09:47

probably a little bit more security focused in terms

Speaker: 00:09:50

of just who's based in that area than your average enterprise. Right?

Speaker: 00:09:55

And I just I just see a lot of enterprises rushing to get into this

Speaker: 00:09:58

LLM and Gen AI craze, but I don't see a lot of

Speaker: 00:10:03

forethought or concern around security. And I just see a big

Speaker: 00:10:06

disaster coming. Like, I I feel like I'm at I feel like I'm on the

Speaker: 00:10:10

bridge of the Titanic, and I'm looking at something in the distance, and we're going

Speaker: 00:10:13

full steam ahead. And I'm like, hey. Maybe we should

Speaker: 00:10:17

not slow down, but be a little more cautious that we are in dangerous

Speaker: 00:10:21

waters. Is that is that what you've seen too? Obviously, your customers

Speaker: 00:10:25

and your clients may be a little more security cognizant.

Speaker: 00:10:30

Well, I would say that I mean, I'm okay. We'll use the Titanic

Speaker: 00:10:33

analogy. I'm the one up in the crows nest, you know, yelling into the radio

Speaker: 00:10:37

phone, I see an iceberg. Right. Right. So I mean, that

Speaker: 00:10:41

I agree. And that is a big issue because

Speaker: 00:10:45

also there is this over reliance. Mhmm.

Speaker: 00:10:48

Yeah. I imagine that as one of the top threats. So tell me about there's

Speaker: 00:10:51

22 of those that I have very, very interesting questions about, but one of them

Speaker: 00:10:55

was overreliance. So when you say overreliance on LLMs, what do you mean?

Speaker: 00:11:00

Well, this is actually this is a sort of c suite, board

Speaker: 00:11:03

level, thing as well as a engineering

Speaker: 00:11:07

department level. They want to use AI to

Speaker: 00:11:11

replace employees, make their operations more cost effective,

Speaker: 00:11:15

more profitable. The problem is and this is a popular conception.

Speaker: 00:11:19

This kind of goes into that argument about AI will take your job.

Speaker: 00:11:24

This is a bit of a misunderstanding. It's not

Speaker: 00:11:28

supposed to fully replace people. It's supposed to make them highly

Speaker: 00:11:31

productive and efficient. They

Speaker: 00:11:35

also do not necessarily feel like, well, the thing handles itself,

Speaker: 00:11:39

so I can just wind it up and let it go. It doesn't need observation.

Speaker: 00:11:43

It can fully self regulate. That would be true if

Speaker: 00:11:47

there was a regulating function. You don't run a steam engine without

Speaker: 00:11:51

a regulator on it. You need a regulator for LLMs.

Speaker: 00:11:55

So the same concept applies. So first of all, there is this, it can do

Speaker: 00:11:59

it itself, and a person is not necessary.

Speaker: 00:12:04

This is incorrect. You most certainly need people.

Speaker: 00:12:08

A great example I give in a recent presentation I've written

Speaker: 00:12:12

is a discussion of, well, what does this mean to the organization?

Speaker: 00:12:16

Well, a lot of level 1 tech, tech

Speaker: 00:12:20

support jobs, there a lot of people say, well, those people are gonna get replaced.

Speaker: 00:12:24

Well, yes, but someone needs to still be behind that LLM

Speaker: 00:12:27

running the prompts, you know, and executing them in such an word and

Speaker: 00:12:31

making interpretations based on the output.

Speaker: 00:12:35

So that would be maybe something okay. Is that a dedicated job, or is that

Speaker: 00:12:39

something you give to interns? Well, that would be, like, in,

Speaker: 00:12:43

in the union trades you call an apprentice.

Speaker: 00:12:46

That's the kind of thing. There's still a person involved. It's

Speaker: 00:12:50

just not the same way we've done it before. Right.

Speaker: 00:12:55

Also, on the subject of security, if you

Speaker: 00:12:58

don't understand the security implications

Speaker: 00:13:02

of it, you don't have controls for it. If you don't have controls for

Speaker: 00:13:06

it, you can't mitigate that risk. And if you can't

Speaker: 00:13:09

mitigate that risk, that's the liability.

Speaker: 00:13:13

And if you're over reliant, you basically set up the whole system for LOMs, and

Speaker: 00:13:17

then, you know, you just allow your customers to just come in and interact with

Speaker: 00:13:20

the device. Well, if something

Speaker: 00:13:24

happens, it would be treated very much like it

Speaker: 00:13:28

was on any other application, so then you're now engaging

Speaker: 00:13:31

in liabilities, loss of reputation, potential

Speaker: 00:13:35

civil and criminal penalties, the list goes on.

Speaker: 00:13:40

And a point on those 10 those 10,

Speaker: 00:13:44

security issues, this is OWOX who is saying this.

Speaker: 00:13:48

This is the open source, web application project.

Speaker: 00:13:52

So we have, you know, a number of them

Speaker: 00:13:56

that are a number of organizations, OOS is just the one I chose, they're

Speaker: 00:14:00

kind of emphasizing this. They're saying, you know, don't think

Speaker: 00:14:04

this thing can think for itself. Don't think this thing can act for itself.

Speaker: 00:14:08

You need to look at it as humans are going to

Speaker: 00:14:11

interact with it, and humans probably should be watching it.

Speaker: 00:14:16

Right. So once again, it's that lack of controls leads to

Speaker: 00:14:19

the risk. Yeah. I think the dream of it replacing

Speaker: 00:14:23

everybody is gonna be at the root cause of

Speaker: 00:14:27

a lot of problems down the road. I think I'm a firm believer

Speaker: 00:14:30

in human in the loop. One of the the the interesting thing

Speaker: 00:14:34

there and, that I see that was particularly

Speaker: 00:14:39

curious was excessive agency. What do you mean by that? Because that got my

Speaker: 00:14:43

attention. I think I know what it means, but I wanna hear it from you.

Speaker: 00:14:46

Well, excessive agency is you're giving you you're kinda giving, you know,

Speaker: 00:14:51

full the whole keys to the car. Right. There's

Speaker: 00:14:54

no role based access control. If every user has near

Speaker: 00:14:58

admin or actual admin privileges,

Speaker: 00:15:02

that's that's actually something dangerous. A point of example,

Speaker: 00:15:06

NetworkChuck just released a video on how to build your own

Speaker: 00:15:10

AI on a very low cost platform.

Speaker: 00:15:14

I love Network Chuck, and I have followed that step. You

Speaker: 00:15:17

too. I'm doing I'm doing the same thing as he is because I have kids,

Speaker: 00:15:21

and I want them to be able to use these things. But 1, I don't

Speaker: 00:15:25

wanna pay the extra subscription. 2, I don't want them using mine. And 3, I

Speaker: 00:15:28

don't really like what they're doing. I can at least exercise adult

Speaker: 00:15:32

judgment on what I ask it and what I don't ask it. I don't think

Speaker: 00:15:35

they can, and I don't think that's fair to put on kids. Sorry for the

Speaker: 00:15:38

aside, but big shout out to network. No. That's fair. No. That's fair. That's exactly

Speaker: 00:15:42

why Chuck was. And one

Speaker: 00:15:46

thing about it is the first account that signs into the open

Speaker: 00:15:50

web interface for Ollama sets you

Speaker: 00:15:53

as admin Right. By default.

Speaker: 00:15:57

Okay. Well, immediately, you need to engage role based access

Speaker: 00:16:01

control to make sure that the next account does not get that same privilege.

Speaker: 00:16:05

Maybe you should be given it. But is there any

Speaker: 00:16:09

major access controls in the public ones?

Speaker: 00:16:13

Not really. Private one? Is everybody thinking about that? Not

Speaker: 00:16:17

really. I mean, I think Microsoft is doing some things around that because it's they're

Speaker: 00:16:20

they're trying to integrate it with Office or m 365. But I

Speaker: 00:16:24

don't I I I can't and if anyone in the sound of my voice wants

Speaker: 00:16:27

to come on the show and talk about that, please do. But you're right. I

Speaker: 00:16:30

don't think people do. And I also think excessive agency.

Speaker: 00:16:35

What you heard about the car dealership, right, in Silicon Valley?

Speaker: 00:16:39

Oh, yeah. Yeah. Yeah. Yeah. So for those who don't know, somebody

Speaker: 00:16:43

managed to almost interrogate, like you said,

Speaker: 00:16:46

to browbeat a AI chatbot to give

Speaker: 00:16:50

him a it was a Chevy Tahoe or something like that for $1

Speaker: 00:16:54

Chevy. It was a it was a Chevy truck

Speaker: 00:16:57

and for $1. Now I'm not an automotive industry

Speaker: 00:17:02

veteran, but I do know that if you sell 40,000, $50,000,

Speaker: 00:17:07

cars for $1 a pop, you're not gonna be in business very long.

Speaker: 00:17:12

So was that an example of excessive agency? I mean, clearly, it's an example of

Speaker: 00:17:15

bad implementation. Almost certainly. That is. I mean, if you have

Speaker: 00:17:19

the ability to trick if you have the ability to kind

Speaker: 00:17:23

of browbeat it to override it and say, no. No. No. You don't understand me.

Speaker: 00:17:26

You will do this. Well, then, okay,

Speaker: 00:17:31

leave it to whatever

Speaker: 00:17:34

gremlins there are out there on the web, out there in the

Speaker: 00:17:38

world. Inside user, external user,

Speaker: 00:17:42

irrelevant. If they can if just anybody can do that,

Speaker: 00:17:46

you're the problem. Right. In this case, it was

Speaker: 00:17:50

you could influence the model to set a

Speaker: 00:17:53

certain price after arguing with it. Right. I actually

Speaker: 00:17:57

found something recently, and I'm not gonna say which, LLM I

Speaker: 00:18:01

did this on. It is a public one, and this is a

Speaker: 00:18:05

result I suspect of another issue.

Speaker: 00:18:10

I saw I tried to get some

Speaker: 00:18:13

cybersecurity information from it when I was doing, a

Speaker: 00:18:19

a try hack me exercise with a local cybersecurity group,

Speaker: 00:18:22

hackers and hops. And I browbeat it

Speaker: 00:18:26

saying, no. You don't understand. I need this for a cybersecurity

Speaker: 00:18:30

exercise, and it gave me this information. Now this is absolute dual

Speaker: 00:18:34

use knowledge. Right. It could be used for good. It could be used

Speaker: 00:18:37

for evil. White hat or black hat. But the fact

Speaker: 00:18:41

that you could do it,

Speaker: 00:18:45

that sounds very dangerous. That sounds very dangerous.

Speaker: 00:18:53

Prompt injection. Is that is that still a thing with

Speaker: 00:18:56

the major public models, or is it just one of those things we're gonna live

Speaker: 00:18:59

with for the rest of our lives? To be honest, I'm not

Speaker: 00:19:03

sure. I mean, it's a case of, well, what is the prompt you're putting

Speaker: 00:19:07

in? Right. When I talk about jailbreaking, I talked about,

Speaker: 00:19:11

base 64 encrypt your text message

Speaker: 00:19:15

into base 64. Why? Because that's how the prompt is seen

Speaker: 00:19:18

by the LLM. Right. In other words, ASCII

Speaker: 00:19:22

text. It doesn't check it, but it processes the text

Speaker: 00:19:26

just the same. Oh, that sounds bad.

Speaker: 00:19:30

It gets worse. Multi shot. Bury a

Speaker: 00:19:33

malicious prompt inside the whole load of prompt,

Speaker: 00:19:37

and fire hose it at the at the LM.

Speaker: 00:19:41

It's not gonna check every single prompt. So if you bury 1

Speaker: 00:19:45

in there, it might process that one and give you an answer

Speaker: 00:19:49

it's not supposed to give. That's because the guardrails didn't engage.

Speaker: 00:19:53

Interesting. So the guardrails are not necessarily on by default.

Speaker: 00:19:58

Well, no. They are on by default, but if it overloads it,

Speaker: 00:20:02

it may it may slip the net. So rather than shut

Speaker: 00:20:05

down, it it shuts off? Well, Well, it's

Speaker: 00:20:09

basically what you're doing is effectively a buffer overflow. You're basically using

Speaker: 00:20:13

an injection method to induce what is effectively

Speaker: 00:20:16

analogous to a buffer overflow. That's wild. That's

Speaker: 00:20:20

not how I would have thought it would have worked. Interesting.

Speaker: 00:20:24

Interesting. This is a fascinating space. So

Speaker: 00:20:27

Yes. One of the things that I think people

Speaker: 00:20:31

don't realize is

Speaker: 00:20:36

just the sick insecure ways in

Speaker: 00:20:40

which these plug ins could be designed. Right? Because, like, everyone's all

Speaker: 00:20:44

gaga about these plug ins, and I look at it. I'm like, where am I

Speaker: 00:20:47

sending my data? Right? Am I gonna read the 30 page EULA? Right? Or

Speaker: 00:20:51

am I just gonna say, yes. Yes. Yes. I wanna do what I'm doing.

Speaker: 00:20:55

Is that really a problem? It is.

Speaker: 00:20:59

Because that kind of ties into unauthorized leakages.

Speaker: 00:21:03

Right. How do I know that plug in is a secure

Speaker: 00:21:07

connection into the l one, and there's nothing in between?

Speaker: 00:21:10

Right. Or that it will contain what I get it.

Speaker: 00:21:15

How do I know? I don't know. That's the thing is that is this plug

Speaker: 00:21:18

in itself secure, and is its connection to the

Speaker: 00:21:22

LLM secure, And is that LLM also

Speaker: 00:21:26

integral? So, yeah, I could send it in there, but how do I

Speaker: 00:21:29

know that along the way, something you know, the pipe might leak?

Speaker: 00:21:34

So you need to check it. Just and, I mean, this goes I mean, this

Speaker: 00:21:37

is very similar to APIs. This is very similar to,

Speaker: 00:21:41

all sorts of remote interfacing. Just good engineering

Speaker: 00:21:45

short lived. Just good engineering discipline seems to be

Speaker: 00:21:49

missing from a lot of this because people are focused on the AI,

Speaker: 00:21:53

not necessarily the underlying infrastructure that

Speaker: 00:21:57

has to support it. Indeed. And I think that that's

Speaker: 00:22:01

but that's the whole thing is that there is this massive trend as

Speaker: 00:22:05

of late. I mean, perhaps it wasn't really emphasized

Speaker: 00:22:08

before. I'm sure it was there, but it's now becoming very, you

Speaker: 00:22:12

know, reiterated that we need to have security by

Speaker: 00:22:16

design. Right. The security by design is already we're already doing

Speaker: 00:22:19

that in other enterprise applications. Same should be applied to

Speaker: 00:22:23

LLMs. Security by design. You check the code. You check the

Speaker: 00:22:27

model. You check everything. And while it's operating,

Speaker: 00:22:31

you check it. One of the biggest things you can do to overcome the

Speaker: 00:22:34

opacity of an LLM, export

Speaker: 00:22:39

the logs, export the comp the prompts.

Speaker: 00:22:43

Have it processed. Now you could potentially process it.

Speaker: 00:22:47

I'd figure the way you process any other kind of log data.

Speaker: 00:22:51

The other thing you can do is use machine learning or

Speaker: 00:22:55

an air gapped isolated LLM

Speaker: 00:22:59

specifically trained to look for signatures,

Speaker: 00:23:04

words, phrases, things like that. And when

Speaker: 00:23:07

these patterns match, it returns saying, I found

Speaker: 00:23:11

something that looks suspect. This is suspect.

Speaker: 00:23:15

Here is the user who did this. Here is their IP.

Speaker: 00:23:19

Like every other bit of log security log information we would get.

Speaker: 00:23:24

So that would help piece together the trail to figure out, are these a

Speaker: 00:23:27

bad actor, or is this the happenstance? Exactly.

Speaker: 00:23:31

And that is one way you can do it because once you have the

Speaker: 00:23:35

internal prompts and you have the internal logs and

Speaker: 00:23:39

those are exported out, you now can see in.

Speaker: 00:23:43

Right. The biggest problem is you gotta have that monitoring. You have to have that

Speaker: 00:23:46

transparency. The elements are so large, you

Speaker: 00:23:50

can't so easily see into them, but if you're taking the data out, it's a

Speaker: 00:23:54

lot clearer. So you can kind of follow what the LLM is doing,

Speaker: 00:23:57

if not, what's inside of it? Precisely. And the advantage

Speaker: 00:24:01

is is if you use another LLM that is specifically designed

Speaker: 00:24:05

to, you know, interrogate the prompts and look through

Speaker: 00:24:08

them, examine them, scan them, whatever word you wish to use.

Speaker: 00:24:12

You can find out where it is because that

Speaker: 00:24:16

is not gonna be so easy to break the guardrails because it's examining

Speaker: 00:24:20

one little bit at a time. It's looking at the individual prompts. It's not really

Speaker: 00:24:24

it it's kind of agnostic about everything around it. It can get it can kind

Speaker: 00:24:28

of filter out the new leads. Interesting. That's

Speaker: 00:24:31

I mean, it's just so fascinating kind of to start pulling the thread at this,

Speaker: 00:24:35

and there's a lot more. It's like I found there's a story about a guy

Speaker: 00:24:38

who was renovating his basement, and he found, like, this ancient underground city. That's how

Speaker: 00:24:42

I feel when I just get kicked back. It's true. It happened in

Speaker: 00:24:45

Turkey. Like, he found, like, this underground network from, like, Byzantine

Speaker: 00:24:49

or Roman times. That's what I feel like. I I like, wow. Like,

Speaker: 00:24:53

this really goes down deep. So what's an

Speaker: 00:24:57

inference attack? Because I've heard of that. What's an inference attack? We discussed that,

Speaker: 00:25:01

or have we touched on that? Well, inference is

Speaker: 00:25:04

basically what you're inferring to, the answer you are seeking.

Speaker: 00:25:08

So, basically, it's basically, to the

Speaker: 00:25:12

the inference is literally, the

Speaker: 00:25:16

prompt that you are entering in and what you're getting out. Okay.

Speaker: 00:25:19

More or less. So how is that an attack surface? Well,

Speaker: 00:25:23

basically, you're you're chaining it. You're daisy chaining your attacks.

Speaker: 00:25:27

You're trying to infer things. You're trying to kinda subtly

Speaker: 00:25:32

get through. So it's a bit like it's a maybe

Speaker: 00:25:35

more like cross examination from an attorney, a hostile attorney

Speaker: 00:25:39

I would say that. Yeah. More than more than, like,

Speaker: 00:25:43

interrogation or torture or or whatever verb we used

Speaker: 00:25:46

earlier. Yes. Interesting. What's

Speaker: 00:25:50

model inversion? Model inversion is

Speaker: 00:25:53

basically you trying to spill the model itself. Oh. You're trying

Speaker: 00:25:57

to kind of you're trying to kind of tear the

Speaker: 00:26:01

guts tear the guts out, maybe put stuff in there,

Speaker: 00:26:05

things of that kind. Interesting.

Speaker: 00:26:09

Interesting. Where do

Speaker: 00:26:12

we stand on the

Speaker: 00:26:18

criminal and civil liabilities here? Right? I I I know that Air

Speaker: 00:26:21

Canada had to pay a fine because they promised that its

Speaker: 00:26:25

chatbot promised somebody something.

Speaker: 00:26:29

I don't know where the California Chevy Tahoe thing

Speaker: 00:26:32

is. But, I mean, have the laws

Speaker: 00:26:36

caught up? Or, like, how were how is this generally looking like?

Speaker: 00:26:41

Well, it depends. I mean, all jurisdictions are different, but I would

Speaker: 00:26:44

suspect to say that whatever guarantees

Speaker: 00:26:48

you make, you're bound to them. So

Speaker: 00:26:52

probably disclaimers, indemnification is

Speaker: 00:26:55

probably extremely wise. I would say,

Speaker: 00:26:59

unfortunately, I'm not a legal expert. Right. Right. Right.

Speaker: 00:27:03

Specifically to the law. Right. But as I'd say, I'd have

Speaker: 00:27:06

enough legal understanding to probably say that if you make a promise,

Speaker: 00:27:10

you better put your money where your mouth is. So that's why I back it

Speaker: 00:27:14

up. IBM indemnifying their users for using one

Speaker: 00:27:18

of their Granite models is probably a big deal for

Speaker: 00:27:21

businesses. Because just in case somebody I'm sure that there's

Speaker: 00:27:25

all fine print and things like that, but that that would be an appealing

Speaker: 00:27:29

thing for business users. Yes.

Speaker: 00:27:33

Interesting. Interesting.

Speaker: 00:27:41

How does someone get started in learning how to jailbreak these? Like, is this is

Speaker: 00:27:45

this a typical your background is, IT security.

Speaker: 00:27:49

But what about someone who has a background in, say, AI and and and building

Speaker: 00:27:53

these LLMs? Is that, Gunning, you think, be an another career

Speaker: 00:27:57

path for the what we call data scientists today?

Speaker: 00:28:01

Well, I would say you're gonna have to probably do it just as is. I

Speaker: 00:28:04

think to the developers and to the data science Right. Scientists who work on this,

Speaker: 00:28:08

you're gonna have to be security literate. Right.

Speaker: 00:28:12

For those who want to get into it, I mean, data science is like any

Speaker: 00:28:16

other AI trade. I mean, we often

Speaker: 00:28:19

cross pollinate. So I would say that you might have an understanding

Speaker: 00:28:23

already of these things. These prompt injections, as I say, are not

Speaker: 00:28:27

much different than SQL injections. The data science Right. You probably know what that is.

Speaker: 00:28:33

How you transfer it depends on what you know.

Speaker: 00:28:37

I would say most data sciences do understand how some of this stuff

Speaker: 00:28:40

works. Right. So getting into it is

Speaker: 00:28:44

just basically you just learning more about security. Right. For the

Speaker: 00:28:48

average person trying to get into it, I would say, if you're trying to

Speaker: 00:28:52

get into AI security, know security

Speaker: 00:28:55

first, and there are many ways to get into

Speaker: 00:28:59

it. I, myself, came in, from my

Speaker: 00:29:02

CCNA. I mean, that's how I kinda got into it. I got

Speaker: 00:29:06

into networks, and then I got into cybersecurity. And

Speaker: 00:29:10

then it was around the time that, you know, the GPTs were really starting to

Speaker: 00:29:13

hit their stride. And it was just part and parcel of it because

Speaker: 00:29:18

I needed a good reference tool. And so then I learned, okay.

Speaker: 00:29:22

Well, how does this work? How do how is it put together? How,

Speaker: 00:29:25

you know, how is it all formed and such? How does

Speaker: 00:29:29

it make its inferences? How does it understand the problems?

Speaker: 00:29:33

So from that, I would say to anybody trying to get into this field,

Speaker: 00:29:37

know cybersecurity first, and you will know AI

Speaker: 00:29:42

in time. AI is in concept

Speaker: 00:29:46

relatively simple, but the nuts and bolts of it are quite

Speaker: 00:29:49

complex. So Yeah. The implementation

Speaker: 00:29:53

details are quite severe. Like, I think

Speaker: 00:29:57

AI is really, I think, better not better suited, but it came

Speaker: 00:30:01

out of the lab. I think the paint is still wet. Paint hasn't dried

Speaker: 00:30:04

yet. And now we're forcing it into an enterprise

Speaker: 00:30:08

scenarios with real customers, real data, real people's lives.

Speaker: 00:30:12

And I don't see a lot of the traditional security

Speaker: 00:30:15

discipline that

Speaker: 00:30:21

I would expect in modern era, modern development.

Speaker: 00:30:25

And even that's a low bar. Even that's a low bar. Let's be real. Well,

Speaker: 00:30:28

it's it's new. Right. It's very shiny.

Speaker: 00:30:32

Mhmm. That's I think that's what I would say is the general

Speaker: 00:30:36

populace and even in the industry that's quite I think our view is that this

Speaker: 00:30:39

is a shiny thing. Right. Well, you know, well, I want

Speaker: 00:30:43

to. You don't even know what it does. I still want it. I want it.

Speaker: 00:30:49

What's interesting is, it

Speaker: 00:30:53

reminds me a lot of the early days of the web where everybody wanted a

Speaker: 00:30:56

website. Well, what are you gonna do with it? I don't know. I just want

Speaker: 00:30:59

a website. You know? It's very it has very very

Speaker: 00:31:02

similar vibe in that regard of we want it. We you know, the hell with

Speaker: 00:31:06

the consequences. But the way I see this

Speaker: 00:31:09

being,

Speaker: 00:31:13

taken up as quickly as it is kind

Speaker: 00:31:17

of worries me. Like, there's gonna be a day of reckoning, I

Speaker: 00:31:21

think, coming. You know? And I thought we

Speaker: 00:31:24

already have it. Right? You you had, there was a leak from Chat

Speaker: 00:31:28

CPT. They had a 100 was a 100000 ish customers there, give or

Speaker: 00:31:32

take? A 100000 credentials taken, compromised.

Speaker: 00:31:35

Credentials and and presumably the data and the chats?

Speaker: 00:31:40

Some of it potentially, I'm sure. But what we're looking at is, like,

Speaker: 00:31:43

names, email addresses. I mean, it depends on how much you put in

Speaker: 00:31:47

that profile. Remember, everything you put in that profile is stored.

Speaker: 00:31:51

Right. Right. That is truly scary.

Speaker: 00:31:56

So you mentioned network, Chuck. So you do you think that

Speaker: 00:32:00

just on a personal level, it's

Speaker: 00:32:03

what worries me about these offline models, right, you run OLAMA locally.

Speaker: 00:32:07

Right? Do you think they could they call

Speaker: 00:32:11

home? Could those be hijacked? Could those have problems?

Speaker: 00:32:15

Specifically. Specifically. Like, so if I'm

Speaker: 00:32:19

running Olama locally, right,

Speaker: 00:32:25

how secure is that? Does that does that depend on the security of my

Speaker: 00:32:29

network, or is there something in there that calls home?

Speaker: 00:32:32

No. Not unless you tell it to. Not unless you try to extract it, you

Speaker: 00:32:36

make a pull, then, yes, it does that. But that's the idea is that once

Speaker: 00:32:40

it's pulled down, it kinda isolates itself. Now

Speaker: 00:32:44

what you can do yourself is set up your

Speaker: 00:32:47

network so that literally it has to be outbound,

Speaker: 00:32:52

a stateful connection, originating outbound.

Speaker: 00:32:56

And you can set that up in your firewall, physical

Speaker: 00:32:59

or otherwise. And you can do things like that, and you can

Speaker: 00:33:03

kind of put it to a point where it doesn't call home unless you tell

Speaker: 00:33:06

it to. Right. And, also, once again, that

Speaker: 00:33:10

private LLM is also very good because you control

Speaker: 00:33:14

the access to what it does. So you can say,

Speaker: 00:33:17

other than these addresses, sanitize it to the

Speaker: 00:33:21

address of wherever the model comes from, say, these are the only ones

Speaker: 00:33:25

allowed. Right. And nobody else is permitted.

Speaker: 00:33:28

Otherwise, implicit deny. Right. So that's a I think

Speaker: 00:33:31

a a small tangible example of something you

Speaker: 00:33:35

can do that is relatively straightforward for any

Speaker: 00:33:38

systems or network engineer, to do just in the hearing

Speaker: 00:33:42

now. But in general, no. They don't normally call without

Speaker: 00:33:46

prompting. Okay. But depends on what they do with those models.

Speaker: 00:33:50

They might put in that kind of feature. A lot of that go back to

Speaker: 00:33:53

the I'm sorry. Yeah. That's kind of my concern is, like, you know, would that

Speaker: 00:33:57

end up in there? Or Well, Meta might put that in there.

Speaker: 00:34:00

Right. Meta is a not alone. Meta is not

Speaker: 00:34:04

exactly free. Right. Matt is not exactly,

Speaker: 00:34:08

has a reputation for privacy. No.

Speaker: 00:34:12

So it's kind of ironic that they are

Speaker: 00:34:16

leading the effort in this space. Seems kind of an odd move.

Speaker: 00:34:21

I I don't know what to say about that. No. No. No. I just need

Speaker: 00:34:25

I have no thoughts on it, but Right. Right. Frankly, I don't I don't know

Speaker: 00:34:28

how relevant it'd be to this discussion. But it's an interesting it's

Speaker: 00:34:32

it's just an interesting time to be in this field, and,

Speaker: 00:34:38

this is just fascinating that you can

Speaker: 00:34:42

jailbreak. You could do this and, you know, even just the basics. Right?

Speaker: 00:34:46

Like, you could do a DOS attack. Right? There's

Speaker: 00:34:49

just basics too. Like, this is still

Speaker: 00:34:53

an IT service no matter how cool it is, no matter futuristic it is. It's

Speaker: 00:34:57

still an IT service, so it has all of those vulnerabilities,

Speaker: 00:35:01

you know, that I don't know. Like, it's just it's just interesting. People are so

Speaker: 00:35:04

focused in the new shiny. I just find it fascinating.

Speaker: 00:35:09

And that's the thing is that this thing is a compounded problem. Right. You

Speaker: 00:35:12

don't just have the usual suspects. You also have

Speaker: 00:35:16

new things that are they

Speaker: 00:35:20

by the virtue of them being new, there's not much

Speaker: 00:35:24

investigation. There's not much study. I mean, amongst my

Speaker: 00:35:28

research for this presentation, I found a number of

Speaker: 00:35:32

papers, white papers coming from all sorts of universities.

Speaker: 00:35:36

They are now looking into this. Right. This is something that maybe we

Speaker: 00:35:39

should have done maybe a while back. Good thing, though, we're doing it now.

Speaker: 00:35:43

Right. But also, also, there's a lot of reasons why you would do that, though.

Speaker: 00:35:47

You would do that because in the wild, you'd be able to identify these things.

Speaker: 00:35:51

Right. You'd be able to see. You're not gonna know everything when something gets released

Speaker: 00:35:54

until it's put out into the wild. Right. And real users

Speaker: 00:35:58

get their hands on it. Good actors, bad actors,

Speaker: 00:36:02

and everything in the middle. Right? Like, you're not gonna yeah. No. I mean, it's

Speaker: 00:36:06

kind of like I guess I guess in a perfect world, the cart would be

Speaker: 00:36:09

before the horse in this case, but that's not the world we live in.

Speaker: 00:36:14

Interesting. So where can

Speaker: 00:36:17

people find out more about you and what you're up to? Well, you

Speaker: 00:36:21

can find me on, LinkedIn. Kevin Lynch

Speaker: 00:36:24

with CCNA. Cool. You can look up my company, Novi Tea Guy,

Speaker: 00:36:29

Novi Tea Guy dot com. And For those outside the area,

Speaker: 00:36:32

Nova stands for Northern Virginia. Just just wanna figure it out there. Well,

Speaker: 00:36:36

also, it well, it's actually a bit of a it's a double meaning. At the

Speaker: 00:36:40

time, I was dedicating myself to IT for the first time. I've done

Speaker: 00:36:43

IT kind of side part of my work. So Nova is also the

Speaker: 00:36:47

Latin for new. So I was Okay. The new IT guy. The

Speaker: 00:36:51

new IT guy. But when it comes to IT, I'm still your guy even then.

Speaker: 00:36:55

There you go. I love it. And,

Speaker: 00:37:00

I'll definitely will include in the show notes a link to your presentation.

Speaker: 00:37:05

And this has been a great conversation. I'd love to have you back and maybe

Speaker: 00:37:07

do your presentation, maybe on a live stream or something like that if you're interested,

Speaker: 00:37:12

and, I'll let Bailey finish the show. And that's

Speaker: 00:37:16

a wrap for today's episode of the data driven podcast.

Speaker: 00:37:19

A huge thank you to Kevin Latchford for shedding light on the vulnerabilities

Speaker: 00:37:23

of large language models and how to stay one step ahead in the ever

Speaker: 00:37:27

evolving world of IT security. Remember, while these

Speaker: 00:37:31

models are brilliant at generating conversation, they aren't infallible

Speaker: 00:37:35

so keep your digital guard up. Until next time, stay

Speaker: 00:37:38

curious, stay safe and always question the source unless,

Speaker: 00:37:42

of course, it's me. Cheers.