Ledelse og Erhvervsøkonomi/Handelsvidenskabeligt Tidsskrift/Erhvervsøkonomisk Tidsskrift, Bind 27 (1963)On the Development of Utility Spaces for Multi-Goal Systems.,As operations research becomes a moire highly developed science and attempts to tackle significant problems of large, multi-goal systems, theory is required which can provide a framework for the optimization of such systems. This article attempts to contribute to such theory. In particular., the article discusses the development of utility functions which express the preferences of those responsible for the system, when these preferences or goals are expressed in several units of measure. A series of theorems is presented which state the necessary and sufficient conditions for the utility functions to have certain desirable forms. The article closes with a discussion of the practical application of the theory developed. By Peter Mark Pruzan *) and J. T. Ross Jackson **) Introduction.The operations researcher is often confronted with the problem of recommending a course of action which will optimize a utility function expressing the preferences of the decision-makers responsible for a system. Optimizing with respect to a system thus compels the operations researcher to consider the objectives associated, with components of the system, objectives which may be in apparent conflict with each other, and which may be expressed in different units of measure. Furthermore, even the sub-systems may be characterized by multi-goal objectives, and thus by measures of performance which are expressed *) Dr. Pruzan, B. S. E. - Princeton University, M.B.A. - Harvard University, Ph. D. in Operations Research - Case Institute of Technology, is employed by IBM A fS, Copenhagen. **) Mr. Jackson, B. S. E. - Queens College, Canada, M. B. A. - Purdue University, is a member of the Operations Research Group, Case Institute of Technology. Side 258
in different units. A decision-maker may specify, for example, that he is motivated by a profit incentive, but that he also pays much attention to certain other variables, such as customer goodwill, market share, sales level, time, personal prestige among his peers, and very likely, many more. The symbolic representation of the utility function for the system is a fundemental task of the operations researcher. Sub-system optimization based upon utility functions of a single variable (e. g. profit) has been widely considered in the literature. However, without a theoretical treatment of the subject of utility functions for multi-goal systems, large system optimization is beyound our reach. Once we are to consider such systems, and to consider decisions which are of such significance that the resulting outcome may cause the system to move far from its existing state, then we must have a theoretical basis for developing analytic expressions for the system's utility. In other words, if operations research is to be able to consider total- versus sub-optimization, if it is to aid the decision-maker when he is faced with significant problems, then the operations researcher must have a well developed theory for developing utility functions for multi-goal systems. We hope that this paper will make some contribution to this important problem area. One of the most widely referred to methods for analytically considering multi-goal systems is the conversion of all the performance measures into one measure, usually into units of money1). We shall refer to this method as Method A. For example, with Method A, socalled 'transformations" are assumed to exist between the units used to describe customer goodwill, market share, sales level, time, etc., and the money units, thus converting all measures to their equivalent monetary values, while perhaps including some of the measures as constraints due to the difficulty of obtaining a proper transformation, or due to a notion that they are not important to the solution of the problem. If for each alternative course of action under consideration a joint probability density function for outcomes is developed, and if a utility function for money is developed, then the expected utility of each course of action can be obtained, and the alternative which maximizes the expected utility can be recommended as the optimal course of action. There: exists
another widely used procedure for evaluating alternative
1) See Ackoff, R. L., Scientific Method, John Wiley and Sons, Inc., New York, 1962. Side 259
that, for each performance measure, a function describes the contributionof that measure to the overall utility of the system, and this function is independent of the levels of all other measures2). We shall refer to this method as Method B. The simplest example of such a method is the use of constant weights to represent the relative contribution of each measure; this is the form of linear value functions such as are used in linear programming problems. If such a utility function, based upon the assumption of independence or seperability, is developed and if a marginal probability density function for each outcome is developed, the expected utility for each alternative course of action can be obtained, and the alternative maximizing the expected utility can then be recommendedas the optimal course of action. It is hoped that
the above comments on multiple -goal utility functions
I. Analytically
consider the calculation of expected utility and to
introduce 11. Examine the assumptions which underly Methods A and B, assumptions which the operations researcher might not always be willing to make were he aware of their implications. The mathematical proofs of the theorems presented are omitted from the paper due to considerations of space, and the desire to avoid confusing the non-mathematical reader. 111. Present a brief discussion of how, based upon the theorems presented, the operations researcher and the decision-maker, working together, may develop appropriate utility functions for operational decision-making in the "real world". I. The nature of the decision-making problem in a multi-goal system:We introduce
the notion of a vector (xi, xs, •••■, xm) as
representing 2) This assumption of independence or separability is frequently used by Fishburn, P. C, in Decision and Value Theory, to be published in spring 1964, by John Wiley and Sons, Inc., New York. Side 260
v= u (xi, X2, ■ ■■ ■, Xm) and that a change in any of the state variables can result in a change in utility to the "individual" whose m-\-l, dimensionalutility space, (xi, X2, ■••-, xm, v), is in question. "Individual" here might refer to a single person, or perhaps a group of people involvedin the decision, such as the managers of the various sub-systems belonging to the overall system to be optimized. We assume that
when a decision-maker must choose between several
We will begin by considering the simple case of only two relevant state variables, and extend the results to m variables at a later stage. We introduce the following notions which occur frequently throughout the development. 1. (xo, yo)
represents the initial state of the system. 2. The courses of
action available are c\, C2, ■■■•, Cj, ••••, cn.
3. To the ;'th course of action there is associated a joint probability density function dFj (x, y) which gives the probability that a course of action a will have the result that the state changes from (xo, yo) to (x,y). That is, dFj (x, y) =Pr [(x,y) (xo,yo),Cj]. 4. To each state (x, y) there can be assigned a relative value or utility, v = u(x,y). To define the scale of v, we assign some arbitrary value to the present state (xo, yo) and some other arbitrary value to another state, say (xi, yi). 5. We will
consider the following three spaces: Side 261
6. It is assumed
that for any of these surfaces, we can develop the
7. We define the
term Vertically Parallel in the following way: a. In vy space:
b. In vx space:
c. In yx space:
That is, two
parametric curves are V.P. if their slopes are equal.
Side 262
Horizontal
parallelism is an identical notion in the horizontal
8. We define a
trade-off function sas follows: Given any rj,
define e = Y (x — rj,v) — Y (x, v) = f (v, x, v) where
Associated with the notion of the trade-off function e is the notion of sliding along a particular indifference curve from a state (x, y) to a new state (x — r), j+e). The trade-off function then tells us how many additional units of ;y the decision-maker requires in order to remain on a particular indifference curve if he is to give up rj units of x. We define a special form of trade-off function, the Complete Trade- Off Functions, as ec = f (x—xo, x,v) = g (xo, x, v). This function specifies the amount sc of y which the decision-maker requires to keep him on the same indifference curve if we take rj = x— xo, that is, if we are to slide from the state (x, y) to the state (xo, 3> + «c). Clearly when x = xo, g (xo, x, v) = 0. Side 263
9. We define a "utility line" to be a particular cut through the utility surface, passing through the initial state (xo, yo). The utility line u (xo, y) represents the intersection of the utility surface u (x, y) and the plane x = xo. Similarly, the utility line u (x, yo) represents the line formed by the intersection of the utility surface u (x, y) with the plane y = yo. It should be noted that a utility line is what Ackoff refers to as a "value function"3). II. Calculation of Expected Utility, Method A (V.P. yxi Approximation):If the functional form of the utility surface were: known over the entire range of possible outcome states, (x, y), the computation and selection of the optimal course of action would be routine. To obtain this surface v = u (x, y), the decision-maker would be required to specify a relative value to each possible outcome state. Ideally, the operations researcher might attempt to help the decision-maker to construct such a surface through a series of questions constructed so as to elicit his value structure. For example, he might help the decision-maker to generate his indifference curves by judiciously choosing intervals of x and y and then forming iso-value curves via some sort of best fit procedure. However, the construction of the utility surface would require the decision-maker to place himself in many hypothetical outcome states with which he may have had little or no past experience. The conception of such states could be quite difficult for him. It would seem to be a logical statement that the further an outcome state (x, y) is from his initial state (xo, yo), the less certain would he be of its relative value; within a certain domain of outcome states near (;vo, yo), we might expect his estimates to reflect his true relative values fairly well. 3) Ackoff, R. L. Scientific Method, John Wiley and Sons, Inc., New York 1962, p. 42. Side 264
The difficulty is that, as yet, there does not exist a proven methodology for generating a utility space, even within a well defined region of the initial state, (xo, yo). However, extensive work has been done in the area of generating "utility lines" for operational problems; see Ackoff, Scientific Method*). We shall show that under certain conditions the utility lines can be used to generate the entire utility surface. In cases where the utility lines cannot be used to generate the entire utility surface, we may be able to use them to generate a surface, which, through subsequent modifications by the decision-maker, will lead to an approximation to the actual surface. Before outlining
the procedures for developing the utility surface
Theorem I. A
necessary and sufficient condition for the complete
where it (xo, y)
is the utility line for outcome 3; Corollary I.
Theorem 11. If u (x, y) is such that the indifference curves are vertically parallel, then there exists a unique transform between the units of the state variable x and the units of the state variable y, independent of y and given by or, equivalently,
Interpretation of
Theorems I and II: If we are
willing to make the assumption of vertically parallel
indifferencecurves, 4) ibid. Chapter 3. Side 265
ferencecurve)but in a plane such
that we know the utility line u (xo, y). Conversely, if we postulate that we can transform the units of state variables x into the units of state variable y by a transform function sc = h (xo, x), we are implying that the utility surface is of a special form, namely having the properties of V.P.yx and H.P.vy. The implications of the existence of such a transform, which does not depend on the state variable y, is that the state variable x is important to the decision-maker only because it can be converted directly into units of state variable y. If the importance of x is dependent on the level of y, or if, as will be shown later, the decision-maker does implicitly or explicitly value x with respect to other state variables in the system, then the indifference curves cannot beVV.Pyx. yx and the unique transform of x into units of y cannot be made. Illustration:
Suppose the
decision-maker is the president of a firm and we define
Then, the existence
of a complete transform function sc (same for all
levels of y) implies that the market share is of importance to the decisionmakeronly in so far as it affects profit rate. In many cases this could quite possibly be true, particularly in a region not too far removed from the initial state, (xo, yo). If, however, the value the; decision-maker assigns to an increase in market share is dependent upon the level of the profit rate, or if he also values an increase in market share because it enhances Side 266
his prestige and for self-satisfaction, or if it bears some relationship to any other component of what we might abstractly consider as his entire value space, then his indifference curves will not be parallel, and no such transformation will exist. This is not to say that there is no relation between percent of market and profit rate. In fact, if sufficient data is available such a relationship can often be established statistically (see for example Ackoff, Scientific Method, pp. 7 8-79)5). The importance of the foregoing theorems is that such a relationship is equivalent to ec only when the indifference curves are vertically parallel. Interpretation of
Parallelism in m-dimensional Value Space: In general, we may consider a decision-maker as having a value space consisting of state variables xi, X2, ■■••, xm, y where yis used to signify some important measure, usually in units of money (e. g., profits, costs, etc.). We can extend Theorem I to the case of m dimensions as follows: Theorem 111. A necessary and sufficient condition for the existence of a set of complete trade-off functions (or transform functions) ejc = Jij (xjo, Xj), j = 1, 2, • • -, m, which transform units of Xj into equivalent units of y, is: where xjo is the
initial value of state variable Xj, j' = 1, 2, . .. m,
and Similarly, in
analogy to the earlier corollary and theorem in three
Corollary I.
Theorem IV.
V.P. yx.
z^sic =hi (xm, Xi), i= 1, 2, ... m 5) ibid. Side 267
Summary of
Theorems I-IV: We now have
developed the necessary theoretical concepts to generate
The expected
utility of the j/th course of action is thus We can now refer
to Method A as V.P. yx. approximation Method B
(V.P. Vx. Approximation): When, for one reason or another, V'.P.yx. approximation does not give a good fit (i. e., the indifference curves are not parallel), a second type of approximation is to assume vertical parallelism in all vx% spaces, Xl, X2, • •••, Xm. dy Side 268
Thus where u (xio, •
■■ xi, . .. Xmo) is the utility line for xt
generated with Hence the
following theorem: Theorem V. A
necessary and sufficient condition for V.P.vx., i= 1,
where Mi {xi) is
the marginal utility of xt and is given by Comments on
Theorem V: The V.P. vx. condition might be described as a "separability" or "independence" condition where the value contributed by each state variable depends in no way upon the levels of other state variables. It is difficult to conceive of such a relation holding over the entire utility space. However, it may give a good approximation within a decision domain near the initial state (#io, • • • ■, xmo). If we assume the V.P. vx. condition then the calculation of expected utility is greatly simplified for we no longer require a joint density function as with V.P. yx. conditions, but need only the marginal probability density functions, dF% (xi). That is, for a given course of action, Side 269
Combination
of V.P.;!,-. and V.P. <■*. Approximations:
It is interesting to note the effect of assuming vertical parallelism in both yx space and vx space, or equivalently, assuming both the one-toone transformation functions and the "independence" of state variables. It turns out that both assumptions combined imply a linear utility curve in y. Hence, we no longer have a problem in value theory. In fact an even
weaker condition, namely, V.P.vy, when combined Theorem VI.
The combined properties of V.'P.yx., i = 1, 2, ■ ■ • •,
m Corollary 11. Under combined properties of Theorem VI, the utility v is just the sum of initial state value, vo and the marginal utilities of y and of Xi which in this case are identical with the transform functions sic, i = 1, 2, • • • •, m. Comments on
Theorem V: From the point of view of value theory, applying the assumptions of Theorem VI results in a degeneracy. Under the assumptions of this theorem, if we for the moment assume that y represents profit, then we maximize expected utility by maximizing expected profit, taking into account that we transform all other state variables into their equivalent profit. This is the same as saying that we assume that the decisionmaker's utility line for profit is linear. It will be noted that in most applications of operations research, "intangibles", such as shortages in inventory problems, machine down-time in replacement, customer waiting time in queueing, etc., are transformed into equivalent costs and revenues, and then the "optimal" policy is chosen as that which maximizes expected profit. Thus, when they follow such a procedure, the operations researchers implicitly assume that the decision-maker's utility line for money is linear! Side 270
We can now formally state, in terms of value theory, the implicit assumptions we are making about the nature of the utility space when we follow these standard procedures; namely, we assume vertical parallelism of all indifference curves, vertical parallelism in all vxt spaces, and horizontal and vertical parallelism (hence linearity) in vy space. III. A Suggested Procedure for the Practical Establishment of Utility Surfaces.The motivation behind this section is to show how, using the theorems presented earlier, the operations researcher may help the decision-maker to establish his appropriate utility surface. The general outline of the procedure is to consider whether: Method A (i.e.,
V.P.yx.) is an appropriate assumption Method B or
"independence" of utilities (i.e., V.P. .-./■.) is
In the discussion
that follows, we restrict ourselves to the case where
By considering only two state variables, the presentation is simplified and we can take advantage of graphic demonstrations. Many of the results can be extended into a general re-dimensional state space but this is beyond the scope of this section. The assumption is made that it is difficult for the decision-maker to describe his feeling about relative values, trade-offs, etc., when he is removed from his present state, described by xo, yo and you =u (xo, yo). Therefore, we attempt to develop a procedure for eliciting information about the actual utility space by continually allowing the decisionmaker to refer to at least one of xo, yo or vo. Depending on
the nature of the problem at hand, the operations
6) To select relevant state variables, the partial derivates of v with respect to these variables must be different than zero. That is, if x, y and u (x,y) are sufficient to describe the value spacer over some reasonable decision-domain, the decision-maker considers that du — (u, y,z)=o oz for any other state variable z. Side 271
more desirable. It is likely that if he is considering many state variables, he would wish to check first for V.P.vx (or V.P.vy) parallelism as this form of parallelism would permit him to use marginal density functions for x and y when calculating expected utility of an action. On the other hand, he may wish to have yx parallelism when dealing with a state space described by only a few state variables. (A) Determining
appropriateness of an assumption of pcirallelism in
It is clear from the theorems that if there exists V,P.yx then there exists a unique complete trade-off function which can be used to translate units of x into units of y, independent of the actual magnitude of the state variable y (or y into units of x in the case of V.P.xy). Thus, if such transformations are considered as proper, the value space describable by x, y, u (x, y) can be equally described by y', u (xo, y'), where y = y~\-h (xo, x), which is far easier to generate than the former space. This becomes all the more true when the number of state variables increases. However, when the number of state variables becomes large, the assumption of "independence" is more desirable mathematically than that of yx parallelism; the latter requires the use of a joint density function associated with each course of action in order to determine expected utility of action, while the former requires only the marginal density functions. The following
steps are suggested to determine the appropriateness of
1. Generate the
indifference curve t>o which passes through (xo, yo).
of different
states of x and. asking him to select the corresponding
Side 272
at (xq, yo) or
moving to (x, y), a state having the same value, vo,
2. Hold x at xo and generate u (xo, y) by the Case Method or some other suitable method. Assume a value for vo, say vo equal 100, and assume a value for some other state (xo, yi) say vi equal 200. This is sufficient to uniquely define a scale of relative values for the utility line u (xo,y). See Figure 2 below. 3. Holding y at yo, determine the states (x, yo) which correspond to the same relative value as the points (xo, yi), (xo, yz) These points can be obtained by asking the decision-maker what value of xi he would require to be indifferent between being at the state (xo, yi) and the state (xi, yo), etc. This permits determination of u (xi, yo), etc. See Figure 3 for the resulting points of equal value for vs = u (x, yo). 4. Check for parallelism in yx space. This can be done by sliding the indifference curve for vo upward and downward and seeing how well it passes through iso-value points. If it passes through these points reasonably well, we can assumeVV.Pyx..yx. (Similarly we check for H.P.yx^ V.P.xy). In order to determine the "goodness of fit" various Side 273
procedures might
be used, including the "eyeball" method, or more
5. If yx parallelism exists, this implies that there exists a unique complete trade-off function and all that is required for decision-making which is consistent with the objective of maximizing expected utility exists in the form of the complete trade-off function sc, the joint probability distribution associated with a course of action, and the utility line v = u (xo, y). (B) Determine the
appropriateness of the independence assumption: 1. From (A) we have developed both u (xo, y) and u (x, yo), the so-called utility lines. Furthermore, these curves are appropriately scaled with respect to each other via steps (A—2, 3). We now develop an analytic expression for each of these curves, perhaps in terms of polynomials, etc. 2. Assuming the property of independence we then derive the mathematical form of the indifference curve with v equal vo. That is, we solve analytical or numerically for the values of x and y which will be such that vx-\-vx-\-vy equal vo. 3. Compare this plot of y versus x with that graph of y versus x obtained in step A-l (see Figure 1). It will be recalled that that plot of the indifference curve vo was obtained via a direct questioning of the decision-maker. 4. If, based upon some statistical or other form of test procedure, it is concluded that the agreement between these curves is sufficient to allow the assumption of independence (i. e., V.P,,rx.), then, given the marginal distribution of x and y for action cn we can immediately calculate the expected utility of choosing the course of action as where Rx and Rv represent the decision domain for x and y, and where cIFn (x) and dGN (y) represent the marginal density functions of x and y respectively with the Nth course of action, cjv, and where vx and Vy are the utility lines established from steps A-2 and A-3. Side 274
(C) If neither
assumption of parallelism nor independence is
acceptable: If neither of the assumptions appear at first to be acceptable by the test criteria, then the decision-maker and the operations researcher must generate the value space v = u (x, y), either by placing the decision-maker in a large number of hypothetical situations (i. e., states) and then attempting to assign relative values to each such state, or else by modifying the utility surface which would be generated under either assumption, until it is acceptable to the decision-maker. The first of these alternatives will be a very difficult procedure, even in the case of only two relevant state variables, and almost impossible with more than two such state variables. (Clearly then, before the two assumptions are both rejected, rejection must be based upon some economic considerations of the cost of error, cost of additional investigation, etc. It has not been our goal to consider the test procedures used in accepting or rejecting the assumptions, but this is a significant problem if the above approaches are to be followed). The second of the two alternatives, that of modifying the utility surfaces which would be generated under either the assumption of yx parallelism or of independence, is therefore a much more appealing alternative. Certainly the fact that the decision-maker and the operations resarcher could have originally rejected the notions of parallelism or independence, supplies information which can be used to modify the surface so as to more closely represent the actual, but as yet unknown, utility surface. Furthermore, it may be found that one or the other of these assumptions hold in part of the decision domain and thus permit the description of this part of the surface with information already available. This could then provide a lead into determining how the remainder of the surface should be changed so as to more accurately represent the decision-maker's true utility surface. |