< Return to Video

36C3 - The Case for Scale in Cyber Security

  • 0:20 - 0:26
    Herald: Vincezo Izzo is entrepreneur and
    investor with a focus on cybersecurity. He
  • 0:26 - 0:32
    has started up, gotten bought, and
    repeated this a few times, and now he is
  • 0:32 - 0:37
    an advisor who advises people on starting
    up companies, getting bought, and
  • 0:37 - 0:45
    repeating that. He is also director at
    CrowdStrike and an associate at MIT Media
  • 0:45 - 0:54
    Lab.
    Just checking the time to make sure that
  • 0:54 - 1:00
    we start on time, and this is, start
    talking now. On the scale of cyber
  • 1:00 - 1:04
    security. Please give a warm welcome to
    Vincenzo.
  • 1:04 - 1:09
    Applause
  • 1:09 - 1:14
    Vincenzo Izzo: So hi, everyone, thanks for
    being here. As Karen said, I have made a
  • 1:14 - 1:19
    few changes to my career, but my
    background is originally technical, and
  • 1:19 - 1:25
    what I wanted to do today is to talk about
    a trend that I think we sort of take for
  • 1:25 - 1:32
    granted and it's to some extent obvious
    but also underappreciated. And that is the
  • 1:32 - 1:40
    cloud scale in security. Specifically,
    when I say cloud scale, what I mean is the
  • 1:40 - 1:47
    ability to process very large amounts of
    data as well as spawn computing power with
  • 1:47 - 1:55
    ease, and how that has played a role in
    our industry in the past decade or so. But
  • 1:55 - 2:01
    before I talk about that, I think some
    context is important. So I joined the
  • 2:01 - 2:06
    industry about 15 years ago and back in
    the days, even even a place like the
  • 2:06 - 2:13
    Congress was a much smaller place. It was
    to some extent cozier and the community
  • 2:13 - 2:20
    was tiny. The industry was fairly niche.
    And then something happened around 2010.
  • 2:20 - 2:24
    People realized that there were more and
    more state sponsored attacks being carried
  • 2:24 - 2:32
    out. From Operation Aurora against Google,
    to the Mandiant report APT1, that was the
  • 2:32 - 2:39
    first reported document how the Chinese
    PLA was hacking west- let's call it the
  • 2:39 - 2:47
    western world infrastructure for IP theft.
    And that changed a lot for the
  • 2:47 - 2:54
    industry. There have been two significant
    changes because of all of this attention.
  • 2:54 - 2:59
    The first one is notoriety. We went from
    being, as I said, a relatively unknown
  • 2:59 - 3:05
    industry to something that everyone talk
    about. If you open any kind of a
  • 3:05 - 3:10
    newspaper, there's almost always an
    article on cybersecurity, boardrooms talk
  • 3:10 - 3:15
    about cybersecurity... and in a sense,
    again, back when I joined, cybersecurity
  • 3:15 - 3:19
    wasn't a thing. It used to be called
    infosec. And now very few people know what
  • 3:19 - 3:25
    infosec even means. So notoriety is one
    thing, but notoriety is not the only thing
  • 3:25 - 3:29
    that changed. The other thing that changed
    is the amount of money deployed in the
  • 3:29 - 3:37
    sector. So, back in 2004, depending on the
    estimate you trust there, the total
  • 3:37 - 3:42
    spending for cybersecurity was between
    three point five to ten billion dollars.
  • 3:42 - 3:49
    Today's over 120 billion dollars. And so
    it kind of looks exponential. But the
  • 3:49 - 3:56
    spending came with a almost... Like, a
    very significant change in the type of
  • 3:56 - 4:01
    players there are in the industry today.
    So a lot of the traditional vendors that
  • 4:01 - 4:05
    used to sell security software have kind
    of disappeared. And what you have today
  • 4:05 - 4:10
    are two kinds of player largely. You have
    the big tech vendors. So you have
  • 4:10 - 4:14
    companies like Google, Amazon, Apple and
    so on, and so forth, that have sort of
  • 4:14 - 4:17
    decided to take security more seriously.
    Some of them are trying to monetize
  • 4:17 - 4:24
    security. Others are trying to use it as a
    sort of like slogan to sell more phones.
  • 4:24 - 4:30
    The other group of people or entities are
    large cloud-based security vendors. And
  • 4:30 - 4:35
    what both groups have in common is that
    they're using more and more sort of like
  • 4:35 - 4:43
    cloud-scale and cloud resources to try to
    tackle security problems. And so what I
  • 4:43 - 4:50
    want to discuss today is from a somewhat
    technical perspective, our scale has made
  • 4:50 - 4:56
    a significant impact in the way we
    approach problems, but also in the kind of
  • 4:56 - 5:01
    people that we have in the industry today.
    So what I'm gonna do is to give you a few
  • 5:01 - 5:08
    examples of the change that we've gone
    through. And one of the, I think one of
  • 5:08 - 5:13
    the important things to keep in mind is
    that what scale has done, at least in the
  • 5:13 - 5:20
    past decade, is it has given defense a
    significant edge over offense. It's not
  • 5:20 - 5:24
    necessarily here to stay, but I think it's
    an important trend that it's somewhat
  • 5:24 - 5:32
    overlooked. So let me start with endpoint
    security. So back in the 80s, a few people
  • 5:32 - 5:38
    started to toy with this idea of IDS
    systems. And the idea behind an IDS system
  • 5:38 - 5:43
    is pretty straightforward. You want to
    create a baseline benign behavior for a
  • 5:43 - 5:48
    machine, and then if that machine starts
    to exhibit anomalous behavior, you would
  • 5:48 - 5:55
    flag that as potentially malicious. This
    was the first paper published on a host
  • 5:55 - 6:00
    based IDS systems. Now, the problem with
    host based IDS systems is that they never
  • 6:00 - 6:07
    actually quite made it as a commercial
    product. And the reason for this... There
  • 6:07 - 6:11
    were largely two reasons for this: The
    first one is that it was really hard to
  • 6:11 - 6:17
    interpret results. So it was really hard
    to figure out: "Hey, here's an anomaly and
  • 6:17 - 6:22
    this is why this anomaly might actually be
    a security incident." The second problem
  • 6:22 - 6:27
    was, you had a lot of false positives and
    it was kind of hard to establish a benign
  • 6:27 - 6:32
    baseline on a single machine, because you
    had a lot of variance on how an individual
  • 6:32 - 6:37
    machine would behave. So what happened is
    that commercially we kind of got stuck
  • 6:37 - 6:44
    with antivirus, antivirus vendors, and
    signatures for a very long time. Now, fast
  • 6:44 - 6:56
    forward to 2013. As I mentioned, the APT1
    report came out and AV companies actually
  • 6:56 - 7:02
    admitted that they weren't that useful at
    detecting stuff like Stuxnet or Flame. And
  • 7:02 - 7:09
    so there was kind of like a new kid on the
    block, and the buzzword name for it was
  • 7:09 - 7:14
    EDR. So, endpoint detection and response.
    But when you strip EDR from like the
  • 7:14 - 7:20
    marketing fluff, what EDR really is, is
    effectively host-based intrusion detection
  • 7:20 - 7:27
    system at scale. So in other words, scale
    and ability to have cloud-scale has made
  • 7:27 - 7:34
    IDS systems possible in two ways. The
    first one is that because you actually now
  • 7:34 - 7:38
    have this sort of like data lake with a
    number of machines, you have much larger
  • 7:38 - 7:44
    datasets to train and test detections on.
    What that means is, it's much easier to
  • 7:44 - 7:50
    establish the benign baseline, and it's
    much easier to create proper detection, so
  • 7:50 - 7:55
    they don't detect just malware, but also
    sort of like malware-less attacks. The
  • 7:55 - 8:01
    other thing is that EDR vendors and also
    companies that have internal EDR systems
  • 8:01 - 8:06
    have -to a large extent- economy of scale.
    And what that means is you can actually
  • 8:06 - 8:11
    have a team of analysts that can create
    explanation and sort of an ontology to
  • 8:11 - 8:18
    explain why a given detection may actually
    represent a security incident. On top of
  • 8:18 - 8:22
    it, because you have those data lake, you
    are now able to mine that for a data to
  • 8:22 - 8:28
    figure out new attack patterns that you
    weren't aware of in the past. So this in
  • 8:28 - 8:32
    itself is a pretty significant
    achievement, because we finally managed to
  • 8:32 - 8:37
    move away from signatures to something
    that works much better and is able to
  • 8:37 - 8:42
    detect a broader range of attacks. But the
    other thing that EDR system solved, sort
  • 8:42 - 8:48
    of like as a side effect, is the data
    sharing problem. So, if you've been around
  • 8:48 - 8:53
    industry for a long time, there have been
    many attempts at sharing threat data
  • 8:53 - 8:58
    across different entities and they all
    kind of failed because it was really hard
  • 8:58 - 9:04
    to establish sort of like a protocol to
    share those data. But implicitly, what EDR
  • 9:04 - 9:13
    has done, is to force people to share and
    collect threat intelligence data and just
  • 9:13 - 9:19
    in general data from endpoints. And so now
    you have the vendors being the sort of
  • 9:19 - 9:24
    implicitly trusted third party that can
    use that data to write detections that can
  • 9:24 - 9:30
    be applied to all the systems, not just an
    individual company or any individual
  • 9:30 - 9:36
    machine. And the result of that, the
    implication of that is that the meme that
  • 9:36 - 9:40
    the attacker only needs to get it right
    once and the defender needs to get it
  • 9:40 - 9:44
    right all the time is actually not that
    true anymore, because in the past you were
  • 9:44 - 9:49
    in a situation where if you had an
    offensive infrastructure, whether it was
  • 9:49 - 9:54
    servers, whether it was exploit chains,
    you could more often than not reuse them
  • 9:54 - 9:59
    over and over again. Even if you had
    malware, all you had to do was to slightly
  • 9:59 - 10:05
    mutate the sample and you would pass any
    kind of detection. But today that is not
  • 10:05 - 10:11
    true anymore in most cases. If you get
    detected on one machine, all of the
  • 10:11 - 10:15
    sudden, all of your offensive
    infrastructure has to be scrapped and you
  • 10:15 - 10:20
    need to start from scratch. So this is the
    first example and I think in itself is
  • 10:20 - 10:27
    quite significant. The second example that
    I want to talk about is fuzzing. And
  • 10:27 - 10:31
    fuzzing is interesting also for another
    reason, which is it gives us a glimpse
  • 10:31 - 10:36
    into what I think the future might look
    like. So as you're probably familiar, if
  • 10:36 - 10:40
    you've done any apps like work in the
    past, Fuzzing has been sort of like a
  • 10:40 - 10:47
    staple in the apps like arsenal for a very
    long time. But in the past, probably five
  • 10:47 - 10:52
    years or so, fuzzing has gone through some
    kind of renaissance in the sense that two
  • 10:52 - 10:58
    things have changed. Two things have
    improved massively. The first one is that
  • 10:58 - 11:05
    we finally managed to find a better way to
    assess the fitness function that we use to
  • 11:05 - 11:10
    guide fuzzing. So a few years ago,
    somebody called Michal Zalewski release a
  • 11:10 - 11:18
    fuzzer called AFL, and one of the primary
    intuitions behind AFL was that you could
  • 11:18 - 11:23
    actually instead of using code coverage to
    drive the fuzzer, you could use path
  • 11:23 - 11:29
    coverage to drive the fuzzer and that
    turned fuzzing in a way more, you know,
  • 11:29 - 11:34
    much more effective instrument to find
    bugs. But the second intuition that I
  • 11:34 - 11:40
    think is even more important and that
    changed fuzzing significantly is the fact
  • 11:40 - 11:46
    that as far as fuzzing is concerned, speed
    is more important than smarts. You know,
  • 11:46 - 11:53
    in a way. And what I mean by this is that
    when you look at AFL, AFL as an example,
  • 11:53 - 11:58
    is an extremely dumb fuzzer. It does stuff
    like byte flipping, bit flipping. It has
  • 11:58 - 12:04
    very, very simple strategies for mutation.
    But what AFL does very well is, it's an
  • 12:04 - 12:10
    extremely optimized piece of C code and it
    scales very well. And so you are in a
  • 12:10 - 12:15
    situation where if you have a reasonably
    good server, where you can run AFL, you
  • 12:15 - 12:23
    can synthesize a very complex file formats
    in very few iterations. And what I find
  • 12:23 - 12:28
    that amazing is that this intuition
    doesn't apply just to file formats. This
  • 12:28 - 12:32
    intuition applies to much more complicated
    state machines. So the other example that
  • 12:32 - 12:38
    I want to talk about as far as fuzzing
    goes, is ClusterFuzz. ClusterFuzz is a
  • 12:38 - 12:46
    fuzzing harness used by the Chrome team to
    find bugs in Chrome and ClusterFuzz has
  • 12:46 - 12:51
    been around for about six years. In the
    span of six years ClusterFuzz found
  • 12:51 - 12:55
    sixteen thousand bugs in Chrome alone,
    plus another eleven thousand bugs in a
  • 12:55 - 13:00
    bunch of open source projects. If you
    compare ClusterFuzz with the second most
  • 13:00 - 13:07
    successful fuzzer are out there for
    JavaScript engines, you'll find that the
  • 13:07 - 13:13
    second fuzzer called jsfunfuzz found about
    six thousand bugs in the span of eight to
  • 13:13 - 13:19
    nine years. And if you look at the code,
    the main difference between the two is not
  • 13:19 - 13:23
    the mutation engine. The mutation engine
    is actually pretty similar. They don't...
  • 13:23 - 13:26
    ClusterFuzz doesn't do anything
    particularly fancy, but what ClusterFuzz
  • 13:26 - 13:33
    does very well is it scales massively. So
    ClusterFuzz today runs on about twenty
  • 13:33 - 13:39
    five thousand cores. And so with fuzzing
    we're now at a stage where the bug churn
  • 13:39 - 13:45
    is so high that defense again has an
    advantage compared to offense because it
  • 13:45 - 13:51
    becomes much quicker to fix bugs than it
    becomes to fix exploit chains, which would
  • 13:51 - 13:57
    have been unthinkable just a few years
    ago. The last example that I want to bring
  • 13:57 - 14:04
    up is a slightly different one. So, a few
    months ago, the TAG team at Google found
  • 14:04 - 14:12
    in the wild a server that was used for a
    watering hole attack, and it was thought
  • 14:12 - 14:17
    that the server was used against Chinese
    Muslim dissidents. But what's interesting
  • 14:17 - 14:21
    is that the way you would detect this kind
    of attack in the past was that you would
  • 14:21 - 14:27
    have a compromised device and you would
    sort of like work backwards from there.
  • 14:27 - 14:32
    You would try to figure out how the device
    got compromised. What's interesting is
  • 14:32 - 14:36
    that the way they found the server was
    effectively to mine their local copy of
  • 14:36 - 14:43
    the Internet. And so, again, this is
    another example of scale that gives them a
  • 14:43 - 14:50
    significant advantage to defense versus
    offense. So, in all of these examples
  • 14:50 - 14:56
    that I brought up, I think when you look
    deeper into them, you realise that it's
  • 14:56 - 15:00
    not that the state of security has
    improved because we've necessarily got
  • 15:00 - 15:06
    better at security. It's that it has
    improved because we got better at handling
  • 15:06 - 15:10
    large amounts of data, storing large
    amounts of data and spawning computing
  • 15:10 - 15:19
    power and resources quickly when needed.
    So, if that is true, then one of... the
  • 15:19 - 15:23
    other thing to realise is that in many of
    these cases, when you look back at the
  • 15:23 - 15:29
    examples that I brought up, it actually is
    the case that the problem at scale looks
  • 15:29 - 15:34
    very different from the problem at a much
    smaller scale, and the solution as a
  • 15:34 - 15:39
    result is very different. So I'm going to
    use a silly example to try to drive the
  • 15:39 - 15:45
    point home. Let's say that your job is to
    audit this function. And so you need to
  • 15:45 - 15:49
    find bugs and this function. In case
    you're not familiar with C code, the
  • 15:49 - 15:58
    problem here is that you can overflow or
    underflow that buffer at your pleasure
  • 15:58 - 16:05
    just by passing a random value for "pos".
    Now, if you were to manually audit this
  • 16:05 - 16:13
    thing, or if your job was to audit
    this function, well, you could use... You
  • 16:13 - 16:18
    would have many tools you could use. You
    could do manual code auditing. You could
  • 16:18 - 16:22
    use a symbolic execution engine. You could
    use a fuzzer. You could use static
  • 16:22 - 16:28
    analysis. And a lot of the solutions that
    are optimal for this case end up being
  • 16:28 - 16:32
    completely useless, if now your task
    becomes to audit this function and this is
  • 16:32 - 16:39
    because the state machine that this
    function implements is so complex that a
  • 16:39 - 16:45
    lot of those tools don't scale to get
    here. Now, for a lot of the problems I've
  • 16:45 - 16:51
    talked about it, we kind of face the same
    situation where the solution at scale and
  • 16:51 - 16:58
    a problem of scale looks very different.
    And so one thing, one realization is that
  • 16:58 - 17:03
    engineering skills today are actually more
    important than security skills in many
  • 17:03 - 17:09
    ways. So when you look... when you think
    back at fuzzers like ClusterFuzz, or AFL,
  • 17:09 - 17:15
    or again EDR tools, what matters there is
    not really any kind of security expertise.
  • 17:15 - 17:20
    What matters there is the ability to
    design systems that scale arbitrarily
  • 17:20 - 17:26
    well, in sort of like their backend, to
    design, to write code that is very
  • 17:26 - 17:33
    performant and none of this has really
    much to do with traditional security
  • 17:33 - 17:39
    skills. The other thing you realize is
    when you combine these two things is that
  • 17:39 - 17:47
    a lot of what we consider research is
    happening in a different world to some
  • 17:47 - 17:52
    extent. So, six years ago, about six years
    ago, I gave a talk at a conference called
  • 17:52 - 17:57
    CCS and it's an academic conference. And
    basically what I... my message there was
  • 17:57 - 18:02
    that if academia wanted to do research
    that was relevant to the industry, they
  • 18:02 - 18:07
    had to talk to the industry more. And I
    think we are now reached the point where
  • 18:07 - 18:13
    this is true for industry in the sense
    that if we want to still produce
  • 18:13 - 18:20
    significant research at places like CCC,
    we are kind of in a bad spot because a lot
  • 18:20 - 18:26
    of the innovation that is practical in the
    real world is happening very large... in
  • 18:26 - 18:31
    very large environments that few of us
    have access to. And I'm going to talk a
  • 18:31 - 18:35
    bit more about this in a second. But
    before I do, there is a question that I
  • 18:35 - 18:42
    think is important to digress on a bit.
    And this is the question of:
  • 18:42 - 18:46
    Have we changed
    significantly as an industry, are we are
  • 18:46 - 18:53
    in sort of like a new age of the industry?
    And I think that if you were to split the
  • 18:53 - 18:59
    industry in phases, we left the kind of
    like artisanal phase, the phase where what
  • 18:59 - 19:04
    mattered the most was security knowledge.
    And we're now in a phase where we have
  • 19:04 - 19:09
    this large scale expert systems that
    require significant more
  • 19:09 - 19:14
    engineering skills, that they require
    security skills, but they still take input
  • 19:14 - 19:19
    from kind of like security practitioners.
    And I think there is a question of: Is
  • 19:19 - 19:24
    this it? Or is this the kind of like where
    the industry is going to stay, or is there
  • 19:24 - 19:31
    more to come? I know better than to make
    predictions in security, 'cause most of
  • 19:31 - 19:36
    the times they tend to be wrong, but I
    want to draw a parallel. And that parallel
  • 19:36 - 19:42
    is with another industry, and it's Machine
    Learning. So, somebody called Rich Sutton
  • 19:42 - 19:46
    who is one of the godfather of machine
    learning, wrote an essay called "The
  • 19:46 - 19:52
    Bitter Truth". And in that essay, he
    reflects on many decades of machine
  • 19:52 - 19:58
    learning work and what he says in the
    essay is that people tried for a very long
  • 19:58 - 20:02
    time to embed knowledge in machine
    learning systems. The rationale was that
  • 20:02 - 20:06
    if you could embed knowledge, you would
    have a smart... you could build smarter
  • 20:06 - 20:12
    systems. But it turns out that what
    actually worked were things that scale
  • 20:12 - 20:18
    arbitrarily well with more computational
    power, more storage capabilities. And so,
  • 20:18 - 20:23
    what he realized was that what actually
    worked for machine learning was search and
  • 20:23 - 20:30
    learning. And when you look at stuff like
    AlphaGo today, AlphaGo works not really
  • 20:30 - 20:36
    because it has a lot of goal knowledge. It
    works because it has a lot of computing
  • 20:36 - 20:44
    power. It has the ability to train itself
    faster and faster. And so there is a
  • 20:44 - 20:49
    question of how much of this can
    potentially port to security. Obviously,
  • 20:49 - 20:53
    security is a bit different, it's more
    adversarial in nature, so it's not quite
  • 20:53 - 20:58
    the same thing. But I think we are... we
    have only scratched the surface of what
  • 20:58 - 21:05
    can be done as far as reaching a newer
    level of automation where security
  • 21:05 - 21:10
    knowledge will matter less and less. So I
    want to go back to the AFL example that I
  • 21:10 - 21:16
    brought up earlier, because one way to
    think about AFL is to think about it as a
  • 21:16 - 21:23
    reinforcement learning fuzzer. And what I
    mean by this... is in this slide, what AFL
  • 21:23 - 21:30
    capable to do, was to take one single JPEG
    file and in the span of about twelve
  • 21:30 - 21:36
    hundred days iteration, they were
    completely random dumb mutation. Go to
  • 21:36 - 21:41
    another well-formed JPEG file. And when
    you think about it, this is an amazing
  • 21:41 - 21:48
    achievement because there was no knowledge
    of the file format in AFL. And so we are
  • 21:48 - 21:53
    in... we are now more and more building
    systems that do not require any kind of
  • 21:53 - 21:57
    expert knowledge as far as security is
    concerned. The other example that I want
  • 21:57 - 22:02
    to talk about is the Cyber Grand
    Challenge. So DARPA ... a few years ago
  • 22:02 - 22:05
    started this competition called Cyber
    Grand Challenge,
  • 22:05 - 22:10
    and the Idea behind cyber grand challenge
    was to try to answer the question of can
  • 22:10 - 22:14
    you automagically do exploit generation
    and can you automatically do patch
  • 22:14 - 22:21
    generation. And obviously they did it on
    some well toy environments. But if you
  • 22:21 - 22:25
    talk today to anybody who does automatic
    export generation research, they'll tell
  • 22:25 - 22:31
    you that we are probably five years away
    from being able to automatically
  • 22:31 - 22:36
    synthesize non trivial exploits, which is
    an amazing achievement because if you
  • 22:36 - 22:41
    asked anybody five years ago, most people,
    myself included, would tell you that
  • 22:41 - 22:46
    time would not come anytime soon. The
    third example that I want to bring up is
  • 22:46 - 22:51
    something called Amazon Macie, which is a
    new sort of service released by Amazon.
  • 22:51 - 22:56
    And what it does is basically uses machine
    learning to try to automatically identify
  • 22:56 - 23:02
    PII information and intellectual property
    in the data. You started with a AWS and
  • 23:02 - 23:08
    then tried to give you a better sense of
    what happens to that data. So in all of
  • 23:08 - 23:12
    these cases, when you think about them,
    again, it's a scenario where there is very
  • 23:12 - 23:21
    little security expertise needed. What
    matters more is engineering skills. So
  • 23:21 - 23:28
    everything I've said so far is reasonably
    positive for scale. Is a positive scale,
  • 23:28 - 23:34
    it is a positive, sort of like case for
    scale. But I think that there is another
  • 23:34 - 23:42
    side of scale that is worth touching on.
    And I think especially to this audience is
  • 23:42 - 23:49
    important to think about. And the other
    side of scale is that scale breeds
  • 23:49 - 23:55
    centralization. And so to the point I was
    making earlier about where, where is
  • 23:55 - 24:00
    research happening, where is real word
    applicable research happening, and that
  • 24:00 - 24:08
    happens increasingly in places like Amazon
    or Google or large security vendors or
  • 24:08 - 24:14
    some intelligence agencies. And so what
    that means is the field, the barriers to
  • 24:14 - 24:20
    entry to the field are are significantly
    higher. So I said earlier that I tried to
  • 24:20 - 24:25
    join the industry about 15 years ago. Back
    then, I was still in high school. And one
  • 24:25 - 24:29
    of the things that was cool about the
    industry for me was that as long as you
  • 24:29 - 24:34
    had a reasonably decent internet
    connection and a laptop, you could
  • 24:34 - 24:40
    contribute to the top of the industry. You
    could see what everyone was up to. You
  • 24:40 - 24:44
    could do research that was relevant to
    what the what the industry was working on.
  • 24:44 - 24:49
    But today, the same sort of like 15, 16
    year old kid in high school would have a
  • 24:49 - 24:55
    much harder time contributing to the
    industry. And so we are in a situation
  • 24:55 - 25:01
    where... but because scale breeds
    centralization. We are in a situation
  • 25:01 - 25:06
    where we will likely increase the barrier
    of entry to a point where if you want to
  • 25:06 - 25:11
    contribute meaningfully to security, you
    will have to go through a very
  • 25:11 - 25:16
    standardized path where you probably do
    computer science and then you go work for
  • 25:16 - 25:26
    a big tech company. And that's not
    necessarily a positive. So I think the
  • 25:26 - 25:31
    same Kranzberg principle applies to scale
    in a sense, where it has done a lot of
  • 25:31 - 25:38
    positive things for the sector, but it
    also comes with some consequences. And if
  • 25:38 - 25:45
    there is one takeaway from this talk
    that I would like you to have is to think
  • 25:45 - 25:52
    about how much something that is pretty
    mundane that we take for granted in our
  • 25:52 - 25:57
    day to day has changed the industry and
    how much that will probably contribute to
  • 25:57 - 26:00
    the next phase of the industry. And not
    just from a technical standpoint. It's not
  • 26:00 - 26:04
    that the solutions we use today are
    much different from what we used to use,
  • 26:04 - 26:08
    but also from the kind of people that are
    part of the industry and the community.
  • 26:08 - 26:13
    And that's all I had. Thank you for
    listening.
  • 26:13 - 26:24
    Applause
  • 26:24 - 26:28
    Herald: Thank you very much. We have time
    for questions. So if you have any
  • 26:28 - 26:32
    questions for Vincenzo, please line up
    behind the microphones that are marked
  • 26:32 - 26:37
    with numbers and I will give you a signal
    if you can ask a question. We also have
  • 26:37 - 26:41
    our wonderful signal angels that have been
    keeping an eye on the Internet to see if
  • 26:41 - 26:47
    there are any questions from either
    Twitter, Mastodon or IRC. Are there any
  • 26:47 - 26:53
    questions from the Internet? We'll just
    have to mic fourth... microphone number
  • 26:53 - 26:58
    nine to be turned on and then we'll have a
    question from the Internet for Vincenzo.
  • 26:58 - 27:01
    And please don't be shy. Line up behind
    the microphone. Ask any questions.
  • 27:01 - 27:05
    Signal Angel: Now it's on. But actually
    there are no questions from the Internet
  • 27:05 - 27:09
    right now.
    Herald: There must be people in the room
  • 27:09 - 27:15
    that have some questions. I cannot see
    anybody lining up. Do you have any advice
  • 27:15 - 27:19
    for people that want to work on some
    security on scale?
  • 27:19 - 27:24
    Vincenzo: I mean, I just had to think a
    lot of the interesting research is
  • 27:24 - 27:28
    happening more and more like tech
    companies and similar. And so as much as
  • 27:28 - 27:34
    it pains me. It's probably the advice to
    think either whether you can find other
  • 27:34 - 27:40
    ways to get access to large amounts of
    data or and computational power or maybe
  • 27:40 - 27:45
    consideresting into one of those places.
    Herald: And we now actually have questions
  • 27:45 - 27:51
    at microphone number one.
    Microphone 1: Can you hear me? Yeah. Thank
  • 27:51 - 27:55
    you for the great talk. You're making a
    very strong case that information at scale
  • 27:55 - 28:00
    has benefited security, but is that also
    statistical evidence for that?
  • 28:00 - 28:05
    Vincenzo: So I think, well, it's a bit
    hard to answer the question because a lot
  • 28:05 - 28:10
    of the people that have an incentive to
    answer that question are also kind of
  • 28:10 - 28:17
    biased, but I think when you look at
    metrics like well, time in terms of how
  • 28:17 - 28:22
    much time people spend on attackers
    machine, that has decreased significantly
  • 28:22 - 28:29
    like it, it has statistically decreased
    significantly. As far as the other
  • 28:29 - 28:33
    examples I brought up, like fuzzing and
    similar. I don't think I as far as I'm
  • 28:33 - 28:43
    aware, there hasn't been any sort of
    rigorous study around where now we are.
  • 28:43 - 28:51
    We've reached the place where defense has
    kind of like an edge against offense. But
  • 28:51 - 28:57
    I think if I talk to anybody who has kind
    of like some offensive security knowledge
  • 28:57 - 29:05
    or they did work in offense, the overall
    feedback that I hear is that it's becoming
  • 29:05 - 29:12
    much harder to keep bug chains alive for a
    long time. And this is in large part not
  • 29:12 - 29:18
    really for for countermeasures. It's in
    large part because bugs keep churning.
  • 29:18 - 29:23
    So there isn't a lot of
    statistical evidence, but from what I can
  • 29:23 - 29:29
    gather, it seems to be the case.
    Herald: We have one more question from
  • 29:29 - 29:32
    microphone number one.
    Microphone 1: So thank you for the
  • 29:32 - 29:36
    interesting talk. My question goes in the
    direction of the centralization that you
  • 29:36 - 29:40
    mentioned, that the large like the
    hyperscalers are converging to be the
  • 29:40 - 29:44
    hotspots for security research. So is
    there any guidance you can give for us as
  • 29:44 - 29:50
    a community how to to retain access to the
    field and contribute?
  • 29:50 - 29:53
    Vincenzo: Yes. So. So I think
    it's an interesting situation
  • 29:53 - 29:57
    because more and more there
    are open source tools that
  • 29:57 - 30:02
    allow you to gather the data. But the
    problem with these data gathering
  • 30:02 - 30:06
    exercises is not too much how to gather
    the data. The problem is what to gather
  • 30:06 - 30:12
    and how to keep it. Because when you look
    at the cloud bill, for most
  • 30:12 - 30:17
    players, it's extraordinarily high.
    And I don't unfortunately, I don't have an
  • 30:17 - 30:23
    easy solution to that. I mean, you can use
    pretty cheap cloud providers, but
  • 30:23 - 30:29
    it's still like, the expenditure is still
    an order of magnitude higher than it used
  • 30:29 - 30:35
    to be. And I don't know, maybe academia
    can step up. I'm not sure.
  • 30:35 - 30:39
    Herald: We have one last question from the
    Internet. And you can stay at the
  • 30:39 - 30:41
    microphone if you have another question
    for Vincenzo.
  • 30:41 - 30:46
    Signal: Yes. The Internet asked that. You
    ask a lot about fuzzing at scale about
  • 30:46 - 30:51
    besides OSS-Fuzz, are you aware of any
    other scaled large fuzzing infrastructure?
  • 30:51 - 30:58
    Vincenzo: That is publicly available? No.
    But when you look at, I mean when you
  • 30:58 - 31:04
    look, for instance, of the participants
    for Cyber Grand Challenge, a lot of them
  • 31:04 - 31:13
    were effectively using a significant
    amount of CPU power for fuzzing. So I'm
  • 31:13 - 31:17
    not aware of any kind of like plug and
    play fuzzing infrastructure you can use
  • 31:17 - 31:26
    aside from OSS-Fuzz. But there is a law,
    like as far as I'm aware, everyone there
  • 31:26 - 31:33
    that does fuzzing for a living has now
    access to significant resources and tries
  • 31:33 - 31:39
    to scale fuzzing infrastructure.
    Herald: If we don't have any more
  • 31:39 - 31:43
    questions, this is your last chance to run
    to a microphone or write a question on the
  • 31:43 - 31:47
    Internet. Then I think we should give a
    big round of applause to Vincenzo.
  • 31:47 - 31:48
    Vincenzo: Thank you.
  • 31:48 - 31:53
    Applause
  • 31:53 - 32:19
    subtitles created by c3subtitles.de
    in the year 2019. Join, and help us!
Title:
36C3 - The Case for Scale in Cyber Security
Description:

more » « less
Video Language:
English
Duration:
32:19

English subtitles

Revisions