Return to Video

1.3. Basic optimization principles

  • 0:04 - 0:12
    We begin our study of the basic principals
    of optimization by looking at a very
  • 0:12 - 0:20
    general form of optimization problem which
    is given by this. Maximize over a variable
  • 0:20 - 0:27
    x. An objective function t of x. To
    provide some motivation think of the
  • 0:27 - 0:35
    problem as a one face by a firm that once
    maximize total profits given by t of x. So
  • 0:35 - 0:43
    these are total profits. That's a function
    of an action that in can take x, which x
  • 0:43 - 0:50
    is any number. A little bit of terminology
    in a general optimization problem the
  • 0:50 - 1:00
    variable that we can control is called the
    control variable. And this is a variable
  • 1:00 - 1:06
    that which is over and the function that
    we are trying to maximize in this case T
  • 1:06 - 1:13
    of X is called the objective function.
    Using this terminology the goal of the
  • 1:13 - 1:19
    optimum maximization problem is to select
    the level of the control variable that
  • 1:19 - 1:25
    generates the maximum valuable of the
    objective function. To provide some
  • 1:25 - 1:34
    intuition consider a graphical
    presentation of the problem. We have some
  • 1:34 - 1:41
    sort of function, let's say something that
    looks like this, is the function t of x.
  • 1:41 - 1:48
    And what we are trying to do is find the
    level of x in this axis that provides the
  • 1:48 - 1:55
    maximum level of a function. In this
    graphical representation this would be
  • 1:55 - 2:05
    this level. Fortunately there is a recipe
    that you learn in calculus that allow us
  • 2:05 - 2:11
    to take a function t apply some
    derivatives and find the general body of x
  • 2:11 - 2:18
    star that maximizes the function. And the
    recipe is given by what are known as the
  • 2:18 - 2:24
    first order. Or necessary conditions and
    second order or sufficient conditions. The
  • 2:24 - 2:31
    first order conditions state that at any
    point that is a local maximum must satisfy
  • 2:31 - 2:38
    the condition that each derivative is
    equal to zero. However as you probably
  • 2:38 - 2:44
    remember for calculus necessary conditions
    are not sufficient some points may satisfy
  • 2:44 - 2:50
    them and it's still not the maximum and we
    will see an example of this in a second.
  • 2:50 - 2:56
    However, if a point also satisfies what I
    call second order of sufficient conditions
  • 2:56 - 3:02
    which in this case is given by second
    derivative is less than zero so then such
  • 3:02 - 3:10
    a point is a local max imum.
    To warm up a practice one which we have
  • 3:10 - 3:17
    just learned lets do a simple example.
    Consider the function ten minus x minus
  • 3:17 - 3:24
    five squared. Think of this as a profit
    function that we are trying to maximize.
  • 3:24 - 3:30
    In order to find the maximum we need to
    first apply the first order conditions
  • 3:30 - 3:37
    which would say that the derivative of the
    objective function which in this case is
  • 3:37 - 3:46
    minus two times x minus five. Has to be
    equal to zero and doing a little bit of
  • 3:46 - 3:57
    algebra this implies that x and the
    optimum. Has to be equal to five. Now we
  • 3:57 - 4:01
    know that these conditions are necessary
    but not sufficient so lets check the
  • 4:01 - 4:06
    second order conditions which are the
    sufficient ones that requires taking the
  • 4:06 - 4:11
    second derivative of the function or the
    derivative of this which in this case
  • 4:11 - 4:17
    means. The second derivative of this
    function is just minus two which is less
  • 4:17 - 4:23
    than zero which implies that the second
    order conditions are satisfied and that
  • 4:23 - 4:29
    the unique local optimum or local maximum
    therefore the global maximum is at x star
  • 4:29 - 4:35
    equals five. Let me provide you with three
    different types of intuition for why the
  • 4:35 - 4:39
    first and second order conditions look
    like they do. Let's just start with a
  • 4:39 - 4:44
    graphical intuition of the first order
    conditions. Suppose that you have a
  • 4:44 - 4:52
    function that looks more or less like
    this. That is the objective function that
  • 4:52 - 4:58
    we're trying to maximize. Remember, that
    the first order condition states that at
  • 4:58 - 5:05
    any local maximum, the derivative at the
    optimum, at the at local maximum has to be
  • 5:05 - 5:11
    equal to zero. Now, to see why this has to
    be the case, consider a point for example
  • 5:11 - 5:16
    like at x, at which the derivative.
    Clearly, it's not equal to zero. In this
  • 5:16 - 5:22
    particular case it's positive. It is easy
    to see that the value of the function of
  • 5:22 - 5:28
    that point can not be a maximum. In other
    words x can not be maximizing the function
  • 5:28 - 5:34
    because given that this lobby is positive
    if you move a little bit to the right,
  • 5:34 - 5:40
    let's say to a point like x plus delta x
    you can increase the value of the function
  • 5:40 - 5:46
    approximately by the value this low times
    delta x. Now, it's also easy to see, that
  • 5:46 - 5:52
    if you're at a point, like X-hat, at
    which, in this case, the value of the
  • 5:52 - 5:58
    slope is negative, you can also be, novy
    maximizing, because if you move to a point
  • 5:58 - 6:05
    like X-hat, minus D-X, minus delta X, you
    can also increase the value of the
  • 6:05 - 6:10
    function for similar resource. Now, it's
    very easy to see, that the only point at
  • 6:10 - 6:16
    which you can not play this trick, of
    improving the value of the objective
  • 6:16 - 6:21
    function, by changing the value X, is a
    point, like, X-star, at which, the slope,
  • 6:21 - 6:26
    is zero, because if you move locally,
    either to the left, or to the right, the
  • 6:26 - 6:32
    value of the objective function would not
    change. This provides an intuition, for
  • 6:32 - 6:37
    why, the first order conditions, have to
    satisfy, this property at a global, at a
  • 6:37 - 6:44
    local maximum. Now consider a graphical
    intuition for why the second order
  • 6:44 - 6:50
    conditions look like they do. To remind
    you, the second order condition is given
  • 6:50 - 6:56
    by, the second derivative and the local
    optimum has to be less than zero. That
  • 6:56 - 7:01
    provides together with the first other
    conditions, sufficient conditions for
  • 7:01 - 7:07
    optimization. Now, to see why that has to
    be the case, compare the following two
  • 7:07 - 7:13
    functions. The one on the left looks like
    this, and it does satisfy the, Second
  • 7:13 - 7:22
    order conditions, are the point X star. In
    fact, it is easy to see that in this case,
  • 7:22 - 7:29
    the second derivative, at X star is
    greater than zero. This low is becoming
  • 7:29 - 7:38
    progressive more positive as you move to
    the right. Just like the one before and at
  • 7:38 - 7:44
    which an x star our candidate for the
    local optimum the second derivative
  • 7:44 - 7:52
    satisfies the second order condition let's
    see why it has to be the case that x star
  • 7:52 - 7:59
    is not a maximum here but is a maximum
    here and the key. Nature of the difference
  • 7:59 - 8:04
    is captured by the second order condition.
    Now, if you start x star in the field on
  • 8:04 - 8:09
    the left and move a tiny bit to the right
    by delta X, since the slope is zero you
  • 8:09 - 8:14
    don't really change the value of the
    function, so that doesn't help you,
  • 8:14 - 8:19
    however and this is the key intuition that
    you have to see, because the second
  • 8:19 - 8:24
    derivative is positive the value of t he
    log changes when you move from Xstar to
  • 8:24 - 8:30
    Xstar + delta X. And becomes positive here
    at the new point. That means that if you
  • 8:30 - 8:35
    carry out a further move, let's say by
    another delta x since this slope was
  • 8:35 - 8:41
    positive at x star plus delta x, the
    function now has increased, and that means
  • 8:41 - 8:47
    that by moving two delta x to the right,
    you can improve the value of the function
  • 8:47 - 8:52
    which means that it cannot be an optimum.
    In contrast see what happens if you tried
  • 8:52 - 8:58
    to play the same game at x star in the
    field on the right. As before the first
  • 8:58 - 9:04
    order conditions are satisfied so eh this
    lobby zero at x star but if you tried to
  • 9:04 - 9:10
    move to the right because lets say by
    delta x because the second derivative is
  • 9:10 - 9:16
    less then zero this lobe actually goes
    down. Unit becomes negative therefore if
  • 9:16 - 9:23
    you try to move a farther the x the value
    of the function goes down. It should be
  • 9:23 - 9:28
    easy to see and you should convince
    yourself that it doesn't matter for this
  • 9:28 - 9:34
    argument whether the movement is to the
    right or to the left. In both cases they
  • 9:34 - 9:40
    go through in an analogous way. This
    demonstrates the intuition for why the
  • 9:40 - 9:46
    second order condition t double prime of x
    star less than zero provide sufficient
  • 9:46 - 9:53
    conditions for the point to be a local
    maximum. Let's look at intuition for why
  • 9:53 - 9:59
    the optimization conditions look like they
    do from a different perspective in this
  • 9:59 - 10:05
    case a mathematical one. By the way I'm
    trying different intuitions because some
  • 10:05 - 10:11
    of you may find some of them more natural
    then the others. Now you will recall from
  • 10:11 - 10:17
    basic calculus that the value of the
    objective function in fact of any function
  • 10:17 - 10:23
    at the point x plus dx can be very well
    approximated by the value of the function
  • 10:23 - 10:30
    at x. Class the derivative of the
    functional x times delta x class the
  • 10:30 - 10:37
    second derivative of the function at x
    times the x square plus a bunch of high
  • 10:37 - 10:46
    order terms that are approximate zero for
    a small changes in the value of x. Now,
  • 10:46 - 10:52
    given this, the change in the fun in the
    objective function produced by the trans
  • 10:52 - 10:59
    delta x can be written as DT. The change
    in the function is approximately equal to
  • 11:00 - 11:10
    the derivative at x times delta x, plus
    the second derivative at x times the x
  • 11:10 - 11:18
    squared. Now for a function to be
    maximized at X, it has to be the case that
  • 11:18 - 11:23
    we cannot find a small change delta X,
    either positive or negative, that
  • 11:23 - 11:28
    increased the value of the function, in
    other words that makes the change in the
  • 11:28 - 11:34
    function positive. But notice that if the
    point X satisfies the first order
  • 11:34 - 11:39
    conditions, the necessary conditions, this
    term is zero, sees the first derivative is
  • 11:39 - 11:44
    equal to zero. Furthermore, if it
    satisfies the second order conditions,
  • 11:44 - 11:51
    this term is negative, and therefore, the
    doesn't change the function has to be less
  • 11:51 - 11:55
    than zero. In other words, when the first
    order conditions and second order
  • 11:55 - 11:59
    condition are satisfied, it is not
    possible to find a change, a small lock
  • 11:59 - 12:08
    and chance, delta x, that increases the
    value of the function. Finally, consider
  • 12:08 - 12:13
    the economic intuition behind the
    maximization problem. Recall the
  • 12:13 - 12:21
    reputation of the representative function,
    T of X, as some sort of benefit of taking
  • 12:21 - 12:29
    action x measuring dollars. For example if
    you would affirm t of x made the note the
  • 12:29 - 12:36
    profit of taking action x. Now under this
    interpretation of the problem the
  • 12:36 - 12:48
    derivative of the function x is called the
    marginal benefit, which is a concept that
  • 12:48 - 12:55
    we will use repeatedly throughout the
    course and that I will abbreviate with the
  • 12:55 - 13:01
    acronym MB. Now, the marginal benefit of x
    is equal to either the increase in benefit
  • 13:02 - 13:15
    produced by increasing X by one unit, or
    alternative, they decrease in benefit
  • 13:15 - 13:25
    produced by decreasing X by one unit. Now
    an important property of an optimum has to
  • 13:25 - 13:31
    be that at the optimum or maximum, the
    marginal benefit has to be equal to zero.
  • 13:31 - 13:38
    Now, why is this the case? Well, suppose
    that is not the case? For example, suppose
  • 13:38 - 13:45
    that the marginal benefit is greater than
    zero then clearly profits or benefits were
  • 13:45 - 13:52
    not maximized at x star since you could
    increase them by increasing x by a little
  • 13:52 - 14:02
    bit. So if you increase x, you increase
    profits, or the benefit function t.
  • 14:02 - 14:08
    Similarly, if the marginal benefit is l
    ess then zero, you can not be maximizing
  • 14:08 - 14:15
    the benefit either, because in this case
    by decreasing x you can also increase
  • 14:15 - 14:23
    benefits. It follows that at an optimum
    marginal benefit has to be equal to zero.
  • 14:23 - 14:29
    Now the last principle that we have seen
    in the economic intuition is so important
  • 14:29 - 14:33
    that is water everything. An economic
    actor that is performing the optimal
  • 14:33 - 14:38
    action would choose its level of activity
    to the point where marginal benefit is
  • 14:38 - 14:43
    equal to zero. Marginal benefit equal to
    zero ingrain that in your heads because
  • 14:43 - 14:48
    otherwise it cannot be optimizing if
    marginal benefit was not equal to zero it
  • 14:48 - 14:54
    could increase its benefit or its profits
    by either increasing its level of activity
  • 14:54 - 14:56
    or decreasing its level of activity.
Title:
1.3. Basic optimization principles
Video Language:
English

English subtitles

Revisions