Download New formulation of minimum-bias central composite

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts

Mathematical optimization wikipedia , lookup

Multidisciplinary design optimization wikipedia , lookup

Transcript
Research Paper
Struct Multidisc Optim 28, 231–242 (2004)
DOI 10.1007/s00158-004-0433-9
New formulation of minimum-bias central composite
experimental design and Gauss quadrature
X. Qu, G. Venter and R.T. Haftka
Abstract Response surface methods provide a powerful
tool for constructing approximations to complex response
functions. Statistical design of experiments is usually
used to select optimal points that minimize the error in
the resulting response surface approximation. Traditionally, data points are selected using minimum-variance
designs, for example the D-optimal design, which may
result in large bias errors for low-order approximation.
Minimum-bias criteria have been developed for selecting data points to minimize the bias error of a response
surface approximation. The present work developed
a minimum-bias counterpart to the popular minimumvariance central composite designs. In addition, a new
formulation of the minimum-bias design that assigns
unequal weights to the design points, based on Gauss
quadrature, is explored. Example problems are evaluated
and the results obtained from D-optimal, the traditional
minimum-bias, and the new Gauss-quadrature-based
minimum-bias designs are compared. It is shown that the
Gauss-quadrature-based minimum-bias design criterion
results in the most accurate approximations and provides
analytical solutions to a wider range of approximation
domains than the traditional minimum-bias design. Response surface approximations based on minimum-bias
central composite designs are more accurate than those
constructed from traditional central composite design.
Moreover, it is shown that using weights in regression
has little influence on the accuracy of the response surReceived: 28 July 2003
Revised manuscript received: 16 April 2004
Published online: 27 July 2004
 Springer-Verlag 2004
X. Qu1, u , G. Venter2 and R.T. Haftka1
1
Dept. of Mechanical and Aerospace Engineering, University
of Florida, Gainesville, FL 32611-6250, USA
e-mail: [email protected], [email protected]
2
Vanderplaats Research and Development, Inc, 1767 S 8th
Street, Suite 200, Colorado Springs, CO 80906, USA
e-mail: [email protected]
Preliminary version of the paper presented at the Fifth
World Congress on Structural and Multidisciplinary Optimization, Venice, Italy, May 19–23, 2003
face approximation in Gauss-quadrature minimum-bias
designs.
Key words central composite design, Gauss quadrature,
minimum-bias design of experiment, response surface approximation
1
Introduction
Response surface methods are used to construct simple approximations to the response of complex systems.
These response surface approximations are used in a large
number of problem areas, including engineering optimization and reliability evaluations (Kaufman et al. 1996;
Venter et al. 1997; Qu et al. 2003). Response surface approximations are generally smooth, low-order polynomials and have the desirable property of eliminating numerical noise, which is inherent in most engineering computer
simulations (e.g. Giunta et al. 1994; Venter et al. 1997).
The designer is also presented with a global view of the
response over the design space (e.g. Mistree et al. 1994),
and the process of integrating several design codes, as is
typically required in the multidisciplinary optimization
process, is greatly simplified (Kaufman et al. 1996; Mason
et al. 1998).
Statistical design of experiments leads to a more efficient construction of the approximations by identifying
a set of points at which to evaluate the response function, selected to minimize the effect of variance (noise)
on the resulting approximation. For regularly shaped approximation domains, there are a number of standard
designs (i.e. sets of data points) available, for example
the central composite (Box and Wilson 1951) and the
Box–Behnken (Box and Behnken 1960) designs. For an
irregularly shaped approximation domain, a computergenerated design must be used, such as the most widely
used D-optimal design (e.g. Myers and Montgomery 1995,
pp. 364–366). The D-optimal design minimizes a measure of the variance associated with the estimates of the
coefficients of the response surface model. Variance-based
232
criteria assume that all errors associated with a response
model are random errors and do not address bias or modelling error due to an insufficient response model.
However, the exact functional form of the response
function to be approximated is rarely known, and loworder polynomials are generally used as response models,
often leading to large bias errors. For regularly shaped approximation domains (cuboidal or spherical regions), fullfactorial minimum-bias designs (Myers and Montgomery
1995, pp. 406–421; Box and Draper 1987) are available.
Balabanov et al. (1996) successfully applied these designs to find a minimum-bias design for a 25-dimensional
sphere. For an irregularly shaped approximation domain,
Venter and Haftka (1997) used a genetic algorithm to select a specified number of design points from a candidate
set of points (similar to the D-optimal design criterion).
The genetic algorithm selects the data points that best
satisfy the minimum-bias conditions. These minimumbias design of experiments are based on minimizing averaged bias errors. Papila et al. (2004) developed a design of experiment to minimize the maximum bias error
using both point-independent and point-dependent error
bounds. Polynomial examples of two variables were provided to demonstrate the methodology.
The first objective of the present work is to explore
a new minimum-bias formulation based on Gauss quadrature (e.g. Nakamura 1991; Burden and Faires 1989) where
unequal weights are assigned to the design points in order
to improve the accuracy of the resulting response model.
Gauss quadrature optimizes both the location and the
weights associated with the integration points to maximize the order of the polynomial that can be integrated
exactly. The Gauss-quadrature integration points and associated weights are a natural choice for minimum-bias
designs, since the conditions that minimize the bias error
may be written in a form that is equivalent to finding
a numerical integration scheme capable of exactly integrating a polynomial of predefined order.
Central composite design (CCD, e.g. Myers and
Montgomery 1995, p. 55) is a popular statistical design
of experiment intended to minimize variance errors. CCD
usually requires fewer points than the Gauss-quadrature
minimum-bias design. Motivated by the use of weights
in Gauss-quadrature minimum-bias design, the second
objective of this work is to develop the minimum-bias
equivalent of CCD, with and without the use of weights
associated with design points. The effects of using weights
in constructing designs of experiments and response surface approximations on the accuracy of the resulting
response surface approximation are also investigated.
Section 2 presents minimum-bias design of experiments with weights. Gauss-quadrature-based minimumbias formulation is proposed in Sect. 3, and applied to
a tubular column example problem in Sect. 4. Section 5
studies the effects of weights in Gauss-quadrature-based
minimum-bias design. Section 6 develops minimum-bias
central composite design and investigated the effects of
weights in constructing design of experiments and regres-
sion. The advantages of minimum-bias central composite
design are demonstrated with a beam optimization example in Sect. 7.
2
Minimum-bias design with weights
2.1
Regression with weights
Response surface methods are used to construct an approximate relationship between a dependent variable f
(the response) and a vector x of n independent variables
(the predictor variables). The response is generally evaluated experimentally (these experiments may be numerical in nature), in which case f denotes the mean or expected response value. It is assumed that the true model
of the response may be written as a linear combination of
β . The
given functions z̃ with some unknown coefficients β̃
experimentally obtained response values y differ from the
expected value f due to random experimental error δ as
follows:
β +δ
y(x) = f (x) + δ = z̃(x)T β̃
(1)
Since the exact dependence of f on x is generally unknown, a response model is used to approximate f (x) as
follows:
y(x) = z(x)T b + ε
(2)
In (2), z(x) contains the assumed functions in the response model and the coefficients of the response model
are b, estimated from experimentally obtained response
values, a process known as regression. Furthermore, ε denotes the error, which is the difference between the predicted and measured response values and includes both
random (variance) and modelling (bias) error. For N design points, (2) may be written in matrix form as:
y = ZB +εε
(3)
The resulting response model
ŷ(x) = z(x) T b
(4)
where ŷ denotes the predicted response values obtained
from the response model.
The unknown coefficients may be estimated by
a weighted least-squares procedure that minimizes the
L2 -norm of the model error, defined as (e.g. Strang 1986)
ε2 = (y − Zb)T WT W(y − Zb)
(5)
where N denotes the number of experimental data points
used in constructing the response model, wi denotes
the weight associated with each design point, and W is
233
a diagonal matrix. The approximate values of the unknown coefficients that minimize ε2 may then be solved
by setting the derivative of ε2 to zero, to obtain
−1
T
T
b = (WZ) WZ
(WZ) Wy
(6)
2.2
Bias and minimum-bias design of experiments
For many engineering applications of response surface approximations, the data are from computer simulations.
The random error in the data is small, but there can be
substantial modelling errors. Model bias occurs when an
insufficient response model is used to approximate the response of a system. Assuming that the true (or exact)
model is a polynomial of higher order than the response
model, the true model may be written as
f (x) = E(y(x)) = z1 (x)T β 1 + z2 (x)T β 2
(7)
where E denotes the expected or mean value, z1 contains the monomials of the true model that is contained
in the response model, while z2 contains the higher-order
monomials of the true model that are not included in the
response model. Furthermore, β 1 and β 2 are the exact,
but unknown, coefficients of the true model associated
with z1 and z2 , respectively. Using the same notation, the
corresponding response model may then be written as
ŷ(x) = z1 (x)T b1
The MSE depends on the design points considered over
the approximation domain, R, and the average mean
squared error (AMSE)
AMSE =
NK
σ2
2
E [ŷ(x) − E ŷ(x)] dx +
R
APV
2
[E ŷ(x) − f (x)] dx , where K = R
1
dx
(11)
R
ASB
and N/σ2 (number of observations divided by the variance of the model) is a mere scale factor. As shown
in (11), the AMSE has two components, namely the average prediction variance (APV) and the average square
bias (ASB). Only the ASB term is considered in the
derivation of the minimum-bias conditions. Using the
above defined quantities, the ASB of (11) becomes
NK
ASB = 2
σ
T T
β 2 −2β
β T2 z2 zT1 Aβ
β2 +
β 2 A z1 zT1 Aβ
R
β T2 z2 zT2 β 2 dx
(12)
At this point, it is convenient to introduce some additional notation as follows:
(8)
Moment matrices, M11 and M12 :
where b1 is a biased estimator of β 1 ( a biased estimator
is one that has a mean value not equal to the estimated
quantity), with its expected or mean value given by
β 2 , where A =
E(b1 ) = β 1 + Aβ
−1
(WZ1 )T WZ2
(WZ1 )T WZ1
M11 =
(WZ1 )T WZ1
N
wi
i=1
2
MSE = E [ŷ(xi ) − f (xi )] =
M11 =
ZT1 Z1
N
(13)
i=1
2
ZT1 Z2
N
(14)
z1 zT1 dx
µ 11 = K
R
z2 zT2 dx
µ 22 = K
E [ŷ(xi ) − E ŷ(xi )] + [E ŷ(xi ) − f (xi )]
M12 =
Region moment matrices µ 11 , µ 22 and µ 12 :
2
SquaredBias
(WZ1 )T WZ2
N
wi
For the traditional minimum-bias conditions where all
design points have an equal weight of one, the moment
matrices of (13) reduce to:
E {[ŷ(xi ) − E ŷ(xi )] + [E ŷ(xi ) − f (xi )]} =
2
M12 =
(9)
and Z1 and Z2 are the matrices corresponding to z1
and z2 .
Both Myers and Montgomery (1995, pp. 406–421) and
Box and Draper (1987) show the traditional derivation
of the conditions that minimize the average squared bias
(ASB) of a response model when all design points have
an equal weight of one (without weights). Following the
derivation of Myers and Montgomery, the more general
derivation where unequal weights are assumed is presented here. The derivation is based on the fact that the
variance and bias error of a response model at a specific
data point, say xi , may be combined to obtain the mean
squared error (MSE) as follows
Variance
NK
σ2
R
(10)
z1 zT2 dx
µ 12 = K
R
(15)
234
The vector α 2 :
α2 =
√
β 2 /σ
Nβ
(16)
Making use of this new notation and some algebraic manipulation, (12) may be written as
µT12µ −1
ASB = α T2 µ 22 −µ
11 µ 12 +
−1
T
µ−1
µ−1
M11 M12 −µ
µ 11 M−1
α2
11 µ 12
11 M12 −µ
11 µ 12
(17)
where µ 11 is positive definite. A sufficient condition for
a minimum-bias design is then to have:
M11 = µ 11
M12 = µ 12
(18)
Note that in order to evaluate (18), the true model must
be known; however, this is seldom the case, and we assume that the true model is a polynomial of higher order
than the response model. Denoting the order of the true
model by d2 , we say that the response model is protected
against a true model of order d2 . It is usually assumed
that the true model is a polynomial one order higher that
the response model.
3
Gauss-quadrature-based minimum-bias formulation
Equation (18) essentially means that minimum bias is
achieved when the integrated moments are equal to the
corresponding sums. Gauss quadrature appears to offer a way to meet these conditions more easily by using
weights. That is, for the minimum-bias design, we can
vary not only the location of the data points but also
the weights associated with them. The Gauss-quadrature
formulation of the minimum-bias design is based on writing the minimum-bias conditions of (18) in the following
equivalent form:
N
Θ dx
wiΘ i
R
=
dx
R
i=1
N
(19)
wi
i=1
where Θ denotes all monomials up to degree (d1 + d2 )
with d1 denoting the order of the response surface model
and d2 denoting the order of the true model. For example,
if the response surface is linear and the true model is
quadratic, then Θ denotes all monomials up to cubic.
In order to satisfy the minimum-bias conditions of
(18), (19) must hold for all monomials Θ . This is equivalent to finding a numerical integration scheme with N integration points that is capable of exactly integrating any
polynomial up to the degree (d1 + d2 ). In the traditional
minimum-bias approach, all weights are assumed equal
and only the location of the integration points may be optimized. However, when positions and weights associated
with the integration points are optimized, we can obtain
an integration scheme capable of integrating higher-order
polynomials exactly. In particular, Gauss quadrature can
integrate the highest possible order polynomial using n
data points.
Gauss quadrature is discussed in detail in a number
of numerical analysis textbooks, for example Nakamura
(1991) and Burden and Faires (1989). One-dimensional
Gauss quadrature with N integration points may be written as
+1
N
f (x) dx ≈
wi f (xi )
−1
(20)
i=1
where xi denotes the ith integration point and wi denotes the weight associated with that point. The Gauss
quadrature of (20) can integrate exactly polynomials up
to degree (2N − 1). For one dimensional problem, it may
be shown that Gauss quadrature of N points as a design
of experiment protects response surface approximation
against true models of the order of 2N − 1 − d1 . For higher
dimension, the number of Gauss-quadrature points required depends on the dimension of the problem and the
number of points along each dimension. Gauss quadrature is very popular in numerical analysis procedures
and is, for example, an integral part of the finite element method (e.g. Cook et al. 1989, pp. 170–173 and pp.
183–185). Due to the popularity of Gauss quadrature,
integration points and associated weights are tabulated
in a number of books (e.g. Stroud and Secrest 1966).
Gauss quadrature also has the advantage that any two or
more integration domains for which a Gauss quadrature
exists may be combined to obtain higher-order and/or
more complicated integration domains. This means that
closed-form solutions are available to a much wider range
of approximation domains as compared to the traditional
minimum-bias design.
4
Tubular column example problem
The first example problem approximates the buckling
load safety margin of a simply supported tubular column, and is an extension of the problem considered by
Venter and Haftka (1997). A quadratic response model
was constructed using a D-optimal minimum-variance design (D-Opt), the genetic-algorithm-based equal-weight
minimum-bias design of Venter and Haftka (1997) (EQMinBias), a Gauss-quadrature minimum-bias design with
weights (GQ-MinBias), and the same Gauss quadrature
minimum-bias design without weights (GQ-NoWeight).
The example problem is shown schematically in Fig. 1.
The corresponding buckling load safety margin, which is
defined as the difference between the critical Euler buck-
235
Fig. 1 Schematic representation of tubular column and optimal design points for the tubular column example
ling load and the applied load, σc , expressed as a stress, is
presented in (21).
σC (D, T ) =
π2 E 2
P
(D + T 2 ) −
8l2
πDT
(21)
In (21), E is Young’s modulus of the material, P is the applied load, and l, D, and T are the length, diameter, and
wall thickness of the column. In the present work, the approximation domain is defined in (22), which represents
a triangular region, as shown in Fig. 1.
D ≤ 5.0 [in] T ≥ 0.05 [in]
D
≥ 10
T
(22)
The buckling load safety margin was approximated
using a quadratic (six-parameter) response model with
twelve design points for estimating the unknown coefficients of the model. As in Venter and Haftka (1997),
a set of 231 evenly spaced data points were considered as candidate points for the D-optimal- and the
genetic-algorithm-based minimum-bias designs. For the
two Gauss-quadrature-based minimum-bias designs, the
same set of Gauss-quadrature integration points defined
for a triangular region (e.g. Cook et al. 1989, p. 184)
were used as design points. For a triangular integration
domain, the twelve-point Gauss quadrature integrates
a sixth-order polynomial exactly. The resulting Gaussquadrature-based minimum-bias design with weights
thus protects the quadratic response model against
a sixth-order true model. The corresponding genetic algorithm minimum-bias design protects the quadratic response model against a third-order true model.
The optimal data points obtained from the three design criteria are shown graphically in Fig. 1. The coordinates and associated weights of the Gauss-quadrature-
Table 1 Optimal data points (Gauss-quadrature minimum-bias design, equal-weight minimum-bias design, and D-optimal design)
D
1.0678
4.7161
4.7161
2.7436
3.8782
3.8782
2.1357
2.1357
3.6034
4.7608
3.6034
4.7608
Gauss quadrature
T
Weight
0.0784
0.0784
0.4432
0.1622
0.1622
0.2756
0.0739
0.1897
0.0739
0.1897
0.3364
0.3364
0.050845
0.050845
0.050845
0.116786
0.116786
0.116786
0.082851
0.082851
0.082851
0.082851
0.082851
0.082851
EW-MinBias
D
T
1.400
1.850
2.300
3.200
3.200
3.650
4.100
4.100
4.325
4.325
4.775
4.775
0.095
0.073
0.163
0.118
0.253
0.298
0.185
0.365
0.073
0.208
0.163
0.410
D-Opt
D
T
0.500
0.500
2.750
2.750
2.750
2.750
5.000
5.000
5.000
5.000
5.000
5.000
0.050
0.050
0.050
0.050
0.275
0.275
0.050
0.050
0.275
0.275
0.500
0.500
236
Table 2 Response models: the design for the EW-MinBias was obtained by genetic optimization of the bias error (Venter and
Haftka 1997) while the GQ-NoWeight design used the Gauss quadrature points in Table 1
Parameter
Intercept
D
T
D2
T2
DT
D-Opt
−85 501.72
38 618.08
80 287.87
−3396.26
−45 066.00
−8418.12
EW-MinBias
GQ-MinBias
GQ-NoWeight
−24 925.67
6709.62
65 240.90
584.85
−73 805.03
−4069.21
−31 491.98
10 317.55
72 853.69
146.79
−56 968.98
−7635.65
−32 781.23
11 088.47
71 224.10
48.03
−51 157.99
−7700.455
based minimum-bias design points obtained from Cook
et al., p. 184 (1989), and the equal-weight minimum-bias
design points obtained from the genetic algorithm and
D-optimal design points given by Venter and Haftka
(1997) are presented in Table 1. The response models
obtained from the data points shown in Table 1 and
Fig. 1 are summarized in Table 2. A response model based
on Gauss-quadrature minimum-bias design but without
using weights in regression is also constructed in order to
investigate the effects of weights on the accuracy of the
constructed approximation.
The predictive capabilities of the respective models
were evaluated based on the independent list of 231
evenly spaced candidate points. The average, RMS (root
mean square), and maximum values of the errors, as well
as the G-efficiency (see Appendix) were calculated and
the results are summarized in Table 3.
The models obtained from the minimum-bias designs
are more accurate (except for the maximum error) than
the corresponding models obtained from the D-optimal
criterion. Both Gauss-quadrature-based minimum-bias
designs outperformed the more traditional, geneticalgorithm-based minimum-bias design except that the
later is slightly better in terms of the averaged error.
Apart from being more accurate, the Gauss-quadraturebased designs were obtained from an analytical solution,
while the traditional designs were obtained by making
use of a computationally intensive genetic algorithm. The
G-efficiency is a measure of the maximum prediction variance of a response model and as such is a variance-type
criterion of the predictive capabilities of a model. As expected, the Geff of the D-optimal design is better than
that of the corresponding minimum-bias designs. Note
that the Gauss-quadrature-based minimum-bias designs
resulted in significantly higher Geff values that the corresTable 3 Predictive capabilities of the quadratic response
models
Design
D-Opt
EW-MinBias
GQ-MinBias
GQ-NoWeight
%AveErr
34.14
8.67
9.23
9.47
%RMSE %MaxErr
49.79
25.82
22.88
22.37
183.28
284.67
255.90
249.58
Geff
100.0%
11.5%
22.2%
23.65%
ponding traditional minimum-bias design. This indicates
that the Gauss-quadrature design is more efficient in reducing the variance error of a model, because the Gaussquadrature-based design selects design points closer to
the perimeter of the design space. An unexpected result
is that, even without using weights, the Gauss points
still perform comparably to that of the Gauss points
with weights. There is almost no difference in the performance of the Gauss-quadrature-based minimum-bias
design with weights and that without weights.
5
Effects of weights in Gauss-quadrature-based
minimum-bias design
The minimal effects of weight on the accuracy of the
approximations prompted us to study a simple quartic function y = x4 . Employing a four-point GQ design,
two response surfaces were built to approximate y = x4
using Gauss-quadrature minimum-bias design with and
without weights. The two response surfaces were exactly
the same, thus of the same accuracy. For a five-point
GQ design, (xi , wi ) = (0, 0.5689), (±0.5385, 0.4786) and
(±0.9062, 0.2369), the two response surfaces are no longer
the same, as shown in Table 4, but the difference in accuracy is minimal. It is worth noting that the RMSE predictor for the case with weights is based on the weight matrix
being the matrix of covariance of the error. Since we are
using weights from minimum-bias design, the RMSE predictor is no longer meaningful for the case with weights.
The true root mean square error (TRMSE) of the response surfaces is calculated exactly via analytical integration with MATLAB.
Weights associated with points in the design of experiments have two contributions in response surface approximations. One is to be used in regression analysis to solve
for the unknown coefficients in the response surface approximation. The results of the tubular and polynomial
examples indicate that using the weights of the Gaussquadrature minimum-bias design contributes little to improving the accuracy of the response surface approximation. The second contribution of weights is that weights
change the location of points in the resulting design of experiments, which is investigated in the next section.
237
Table 4 Comparison of statistics of quadratic response surfaces built from five-point GQ minimum-bias design with and without
using weights for test function x4
GQ Min-bias
GQ Min-bias without weights
Test function
−0.1016 + 0.9111x2
0.8956
0.3034
0.0779
0.4
y = x4
N/A
0.2
0
0.4
RS
−0.08572 + 0.85714x2
2
R Adj.
0.8367
Mean of response
0.3034
TRMSE (exact)
0.0762
Integrated x4 over [−1, 1]
0.4
N
6
Minimum-bias CCD
i=1
While the finite-element literature provides useful Gaussquadrature data for use as experimental designs, there is
no minimum-bias equivalent for the central composite design (CCD, e.g. Myers and Montgomery 1995, p. 55). The
CCD is a popular minimum-variance design that is useful for low dimensions. The number of points in CCD is
determined by (2n + 2n + 1), where n is the dimensionality of the problem (n = 3 to 6 is studied here). To protect quadratic polynomials against quartic polynomials,
Gauss-quadrature minimum-bias design needs 3n points,
which is more than that required by CCD. We decided
to develop a minimum-bias CCD to provide an attractive alternative to the GQ-based minimum-bias design
presented here. Minimum-bias CCD with and without
weights are developed for three to six dimensions, and the
contribution of the weights in constructing the design of
experiments is also investigated.
For minimum-bias CCD with weights, there are only
five design variables: the weight of the center point, the
weight and position of the vertices, and the weight and
position of the axial points. In n dimensional space, if
a quadratic response surface is employed and the true
function is a quartic, according to the minimum-bias design condition shown by (18), the following seven equations need to be satisfied by the coordinates xi and the
weights wi of the N points in the CCD
N
wi = 1 , N = 2n + 2n + 1 ;
i=1
N
i=1
N
i=1
1
wi ;
5 i=1
N
wi x4i =
N
1
wi ;
3 i=1
N
wi x2i =
1
wi ;
7 i=1
N
wi x6i =
i=1
N
1
wi ;
9 i=1
N
wi x2i u2i =
1 wi ;
27 i=1
i=1
−1 ≤ xi , ui , vi ≤ 1, 0 ≤ wi
(23)
where xi , ui , and vi are any three coordinates of the
points. The weights and coordinates of the points in the
above equations are substituted according to the position
of the points (center, vertices and axial points). Because
there are more equations than unknowns, the least-square
solver in MATLAB was used to solve the seven equations of (23). The solutions for three to six dimensions are
shown in Table 5. For comparison, CCDs without weight
(equal weights) were also obtained and shown in Table 6.
It is seen that while the L2 norm of the residual of (23) is
much higher without weights, the difference in positions
is small. It can be seen that almost all the designs have
points on the faces of the box, but the vertex points are
much closer to the center than in the standard CCD.
Using the minimum-bias CCD in 6-D, a quadratic
response surface is constructed to approximate a full
quartic polynomial with all coefficients equal to one.
To investigate the efficiency of minimum-bias CCD, two
other quadratic response surfaces were fitted to the test
function using a regular CCD and a face center central composite design (FCCCD, e.g. Myers and Montgomery, 1995, p. 313). Since the RMSE predictor is no
longer meaningful for the case with weights, the RMSE of
the response surfaces are compared by using test points.
The true root mean square error (TRMSE) is calculated
exactly via analytical integration with MATLAB, and
also approximated using 46 656 structured test points
that were chosen as a uniform 66 grid, and 46 656 ran-
n
XAxial
XVertex
WeightAxial
WeightCenter
WeightVertex
L2 Norm∗
3
4
5
6
1
1
1
1
0.6781
0.6781
0.6781
0.6934
0.04512
0.04512
0.04512
0.04119
0.2016
0.1113
0.02106
0.0
0.06591
0.03296
0.01648
0.007893
0.02109
0.02109
0.02109
0.02429
Residuals of (23)
i=1
1 wi ;
15 i=1
N
wi x2i u4i =
N
wi x2i u2i vi2 =
Table 5 Numerical solutions to minimum-bias CCD of 3, 4, 5 and 6 dimensions
∗
N
238
Table 6 Numerical solutions to minimum-bias CCD without
weights of 3, 4, 5 and 6 dimensions
n
XAxial
XVertex
L2 Norm∗
3
4
5
6
0.9313
1.0
1.0
1.0
0.6634
0.6488
0.6462
0.6390
0.4726
0.7939
2.6194
6.5138
∗
Residuals of (23)
domly generated points uniformly distributed in the design space.
The accuracy of the four response surfaces is compared in Table 7 for a 6-D quartic polynomial. It is seen
from the TRMSE that the accuracy of the response surface approximation based on minimum-bias CCD is the
best, and the difference between the designs with and
without weights is minimal. FCCCD is much better than
the regular CCD. The difference between the performance of regular CCD and FCCCD is due to the location
of the axial points. For a regular CCD, the axial points
are set at 2.8284 on the axis for a rotatable orthogonal
CCD, which will induce large errors with the presence of
bias errors. It is seen that 46 656 random test points provide close estimation to the TRMSE, while the estimation
from structured 46 656 points as 66 grid is less close to the
TRMSE. This reflects the fact that, of the 46 656 points,
42 560 (66 − 46 ) points are on the boundary of the domain. If we were using these points to estimate the RMSE
by integration, at least trapezoidal integration should be
used. It is seen in Table 7 that the use of the trapezoidal
rule improves the RMS error estimate substantially by
drastically reducing the weight of the boundary points
compared to the interior points. This demonstrates that,
when bias errors dominate random errors, designs of experiments and error estimation that arrange points inside
the domain generally provide better results than those
putting points on the boundary of the domain.
Numerical response is usually noisy, thus it is necessary to investigate the performance of minimum-bias
CCD for response with noise. We compare the four designs for the quartic polynomial with all coefficients one
and normally distributed noise that has a mean value of
0 and standard deviation of 10 percent of the average of
the test function over the unit cube, Normal (0, 0.5867).
The accuracy of the four response surface approximations
is compared in Table 8. It is seen that the accuracy of the
response surface approximations based on minimum-bias
CCD with or without weights is much better than that of
the response surface approximations based on CCD and
FCCCD. FCCCD is still better than the regular CCD.
It is seen from Table 8 that a smaller L2 Norm indicates better RS accuracy. But, if the L2 Norm is relatively small, such as those of minimum-bias CCD with
and without the use of weights, the difference in the accuracy of the RS is very small, which means that reducing
the L2 Norm beyond a certain level does not improve RS
accuracy significantly. For the cases where the L2 Norm is
large, such as in FCCCD and CCD, the errors of RS based
on the two designs are very large.
Table 7 Comparison of statistics of quadratic response surfaces for a six-variable full quartic polynomial with all coefficients
equal to one
Error Statistics
Minimum-bias CCD
with weights
Minimum-bias CCD
without weights
Regular CCD
FCCCD
0.9263
7.7445
1.8238
1.8557
3.6842
2.2248
0.9482
6.2581
1.8329
1.8497
4.3216
2.4543
0.8667
34.6623
32.5793
32.5255
25.9872
31.4441
0.9475
23.7533
7.4293
7.4718
7.5856
7.3776
Rsquare Adj.
Mean of response∗
TRMSE (exact)
RMSE (46 656 random points)
RMSE (46 656 structured points)
RMSE (46 656 trapezoidal integration)
∗
The exact mean of a full quartic polynomial with all coefficients equal to one is 88/15 = 5.8667.
Table 8 Comparisons of statistics of quadratic response surfaces based on full quartic test function with all coefficients one and
10 percent noise
Error Statistics
Rsquare Adj.
Mean of response
TRMSE (exact)
L2 Norm of residual of (23)
Minimum-bias CCD
with weights
Minimum-bias CCD
without weights
Regular CCD
FCCCD
0.9341
5.7420
1.8716
0.02109
0.9492
6.2554
1.9317
6.514
0.8665
34.5295
32.5732
132.3
0.9470
23.7137
7.4208
1097
239
7
Beam optimization example
Response surface approximations based on engineering
simulation are commonly used for optimization in engineering applications. In order to demonstrate the effect of
approximation accuracy based on various CCD designs,
an example of a six-variable beam optimization problem
is solved here.
7.1
Problem description
The beam is modelled by three segments and is subjected
to a tip load as shown in Fig. 2. The optimization problem is to minimize the weight of the beam, subject to
constraints on maximum bending stress, maximum tip
displacement and geometry. The weight is minimized by
changing the height and width of each segment, resulting in six design variables. The material properties and
problem parameters are shown in Fig. 2.
VisualDOC (Vanderplaats, 2003) was used to create
the approximations and to perform the optimization. The
geometric constraints are fairly simple and are not approximated. Instead, they are represented with linked relationships in VisualDOC as hi − 20bi ≤ 0. The volume
of the beam is used as the objective function. The tip
displacement and maximum bending stress in each component is constrained as y − 2.5 cm ≤ 0 and σi − σ̄ ≤ 0,
respectively. Approximations for the objective function
(volume), tip displacement and maximum bending stress
component in each segment were created, for the minimum bias CCD (with and without weights) and the FCCCD designs. All designs used 77 data points to fit the 28
polynomial coefficients.
Fig. 2 The cantilevered beam
Table 9 Optimization results
b1 (cm)
b2 (cm)
b3 (cm)
h1 (cm)
h2 (cm)
h3 (cm)
Objective (cm3 )
Tip disp (cm)
Sigma 1 (N/cm2 )
Sigma 2 (N/cm2 )
Sigma 3 (N/cm2 )
∗
∗∗
Exact
FCCCD
MB_CCD
with weights∗
MB_CCD
without weights∗∗
2.9920
2.6138
2.0746
59.8408
52.2758
41.4913
66 960.18
2.2069
14 000.00
13 999.99
14 000.00
2.8613
2.5859
2.1222
57.2260
51.7184
42.4434
64 592.49
2.1933
13 997.83
14 020.06
14 022.19
3.0001
2.5971
2.0855
59.9400
51.9415
41.7101
66 966.93
2.1901
14 005.48
14 000.36
13 959.24
3.0103
2.5961
2.0819
59.9400
51.9222
41.6381
66 986.30
2.1968
14 025.74
14 017.91
13 997.92
Minimum-bias CCD with weights from Table 5
Minimum-bias CCD without weights from Table 6
240
Table 10 Exact values of the objective, the displacement and the stresses at optima in Table 9
Objective (cm3 )
Tip disp (cm)
Sigma 1 (N/cm2 )
Sigma 2 (N/cm2 )
Sigma 3 (N/cm2 )
Exact
FCCCD
MB_CCD
with weights
MB_CCD
without weights
66 960.18
2.2069
14 000.00
13 999.99
14 000.00
64 592.50
2.2016
16 008.16
14 457.53
13 078.47
66 966.93
2.2277
13 916.34
14 272.07
13 780.89
66 986.3
2.2352
13 869.23
14 287.97
13 852.51
7.2
Results
The results without and with approximation are shown in
Table 9. In order to obtain baseline results for comparing
the results obtained when using the approximations, the
problem was first directly optimized without using any
approximations.
Table 10 provides exact values of the objective, the
displacement and the stresses at the optima found by
each response surface approximation as a comparison of
the accuracy of the response surface approximations. Recall that Table 9 provides the optimum function values
obtained from the approximations. As a comparison of
the accuracy of the response surface approximations,
Table 10 provides the exact function evaluations at the
optimum points reported in Table 9. Comparing Tables 9
and 10, it is clear that the volume is exactly approximated
in each case. This is to be expected since the volume is
a quadratic function of the design variables. However, the
stress constraints are not exactly approximated, with the
FCCCD approach resulting in the least accurate stress
approximations. For example, the FCCCD approximations led to an optimum that had a stress violation of
12.6% for Sigma 1, resulting in a lower volume value as
compared to the other approaches. In contrast the two
minimum-bias designs resulted in much more accurate
stress approximations, with a maximum stress violation
of about 2% at the optimum. It is seen that the effect of
using weights is insignificant.
8
Concluding remarks
Minimum-bias-based design of experiments can effectively reduce the bias errors in response surface approximations. Minimum-bias design is traditionally performed with equal weights assigned to each design point.
A new formulation of minimum-bias design, based on
Gauss quadrature that utilizes weights as in numerical
quadrature, is presented. Based on the tabulated values
of the Gauss quadrature integration points and associated weights, the new formulation appears to provide the
designer with a more accurate, closed-form minimumbias design for a wider range of approximation domains
with much lower computational intensity than traditional
minimum-bias design. In all cases considered where the
bias or modelling error dominates, the response surface
approximations obtained from the minimum-bias-based
designs are more accurate than similar response models
based on the D-optimal criterion.
Central composite design is a popular minimumvariance design of experiment and requires smaller numbers of design points than the Gaussian-quadrature
minimum-bias design, thus is more affordable for computationally expensive problems. Minimum-bias central
composite design with and without the use of weights
in low dimensions (e.g. three to six) were developed to
provide the minimum-bias equivalent of the central composite design. Minimum-bias central composite designs
with and without the use of weights outperformed the
traditional central composite design in terms of the accuracy of response surface approximation. It is demonstrated that, when bias errors dominate random errors,
designs of experiments and error estimation that arrange
points inside the domain generally provide better results than those putting points on the boundary of the
domain.
Weights associated with points in the design of experiments have two contributions in response surface approximations. One is to be used in regression analysis to solve
for the unknown coefficients in the response surface approximation. The second contribution of weights is that
the weights change the location of the design of experiment points. The results of the tubular buckling and polynomial examples indicate that using weights of Gaussquadrature minimum-bias design in the regression analysis contributes little toward improving the accuracy of the
constructed response surface approximation. The beam
design and polynomial examples demonstrated that the
use of weights in the construction of minimum-bias central composite design has minimal influence on the accuracy of the resulted response surface approximation.
For response surface approximation in high dimensions (> 10), the number of points required by Gauss
quadrature or central composite design can be very high.
Minimum-bias design based on the principle outlined in
this paper can be developed for currently available design
of experiments for high dimensions, such as fractional factorial design, orthogonal arrays or Latin hypercube sampling (Qu et al. 2003).
241
Acknowledgements This work was supported in part by Grant
NAG-1-2177 from NASA Langley Research Center and NSF
Grant DM5-9979711.
References
Balabanov, V.; Kaufman, M.; Knill, D.L.; Haim, D.; Golovidov, O.; Giunta, A.A.; Grossman, B.; Mason, W.H.; Watson, L.T.; Haftka, R.T. 1996: Dependence of Optimal Structural Weight on Aerodynamic Shape for a High Speed Civil
Transport. Proc. of 6 th Symp. on Multidisc. Anal. Optim.
(held in Bellevue, WA, USA), 599–612, AIAA-1996-4046
Box, G.E.P.; Wilson, K.B. 1951: On the Experimental Attainment of Optimum Conditions. J Roy Stat Soc B, 1–45
Box, G.E.P.; Behnken, D.W. 1960: Some New Three-Level
Designs for the Study of Quantitative Variables. Technometrics 30, 1–40
Box, G.E.P.; Draper, N.R. 1987: Empirical Model-Building
and Response Surfaces. 437–442, New York: Wiley
Burden, R.L.; Faires, J.D. 1989: Numerical Analysis. 198–203,
4th edn. Boston: PWS-Kent
Cook, R.D.; Malkus, D.S.; Plesha, M.E. 1989: Concepts and
Applications of Finite Element Analysis. New York: Wiley
Giunta, A.A.; Dudley, J.M.; Narducci, R.; Grossman, B.;
Haftka, R.T.; Mason, W.H.; Watson, L.T. 1994: Noisy Aerodynamic Response and Smooth Approximations in HSCT
Design. Proc. of 5 th Symp. on Multidisc. Anal. Optim. (held
in Panama City, Florida, USA), 1117–1128, AIAA-1994-4376
Kaufman, M.; Balabanov, V.; Grossman, B.; Mason, W.H.;
Watson, L.T.; Haftka, R.T. 1996: Multidisciplinary Optimization via Response Surface Techniques. Proc. of 36 th Israel
Conf. on Aero. Sci. (held in Israel), A-57–A-67
Mason, B.H.; Haftka, R.T.; Johnson, E.R.; Farley, G.L. 1998:
Variable Complexity Design of Composite Fuselage Frames by
Response Surface Techniques. Thin Wall Struct 32, 235–261
Papila, M.; Haftka, R.T.; Watson, L. 2004: Bias Error Bounds
for Response Surface Approximations and Min-Max Bias Design. in preparation for AIAA J
Mistree, F.; Patel, B.; Vadde, S. 1994: On Modeling Objectives and Multilevel Decisions in Concurrent Design. Proc. of
20 th ASME Design Automation Conf. (held in Minneapolis,
Minnesota, USA), 151–161.
Strang, G. 1986: Introduction to Applied Mathematics.
137–154, Massachusetts: Wellesley-Cambridge
Stroud, A.D.; Secrest, D. 1966: Gaussian Quadrature Formulas. New Jersey: Prentice-Hall
Vanderplaats, G. 2003: VisualDOC 3.0 User Manual. Colorado: Vanderplaats Research & Development
Venter, G.; Haftka, R.T. 1997: Minimum-Bias Based Experimental Design for Constructing Response Surfaces in Structural Optimization. Proc. of 38th AIAA/ASME/ASCE/AHS/
ASC Struct., Struct. Dyn., Mater. Conf. (Held in Kissimmee,
Florida, USA), 1225–1238, AIAA-1997-1053
Venter, G.; Haftka, R.T.; Chirehdast, M. 1997: Response Surface Approximations for Fatigue Life Prediction. Proc. of
38th AIAA/ASME/ASCE/AHS/ASC Struct., Struct. Dyn.,
Mater. Conf. (Held in Kissimmee, Florida, USA), 1383–1396,
AIAA-1997-1331
Appendix:
Predictive capabilities
The percent average error (%AveErr), the percent root
mean square error (%RMSE) and the percent maximum
error (%MaxErr) calculated using m test points in the
present paper are defined as:
100 %AveErr =
|yi − ŷi|
ȳm i=1
m
m
100 1
%RMSE =
(yi − ŷi )2
ȳ
m i=1
%MaxErr =
100
Max [|yi − ŷi|]
i
ȳ
(A.1)
(A.2)
(A.3)
where:
1 |yi |
m i=1
m
ȳ =
(A.4)
The G-Efficiency (Geff ) is defined as (Myers and Montgomery 1995, p. 367)
Geff =
p
Max [v(xi )]
(A.5)
i
Myers, R.H.; Montgomery, D.C. 1995: Response Surface
Methodology-Process and Product Optimization Using Designed Experiments. New York: Wiley
where p denotes the number of parameters (monomials)
in the response model and v(xi ) denotes the scaled prediction variance. The scaled prediction variance, defined
as
Nakamura, S. 1991: Applied Numerical Methods with Software. 123–130, New Jersey: Prentice-Hall
v(xi ) =
Qu, X.; Venkataraman, S.; Haftka, R.T.; Johnson, T.F. 2003:
Deterministic and Reliability-based Optimization of Composite Laminates for Cryogenic Environments. AIAA J 41,
2029–2036
N Var(ŷi )
σ2
(A.6)
satisfies the property
Max [v(xi )] ≥ p
i
(A.7)
242
Also, σ2 denotes the variance of the error and Var(ŷi ) denotes the prediction variance of data point i. For the case
of unequal weights, the prediction variance may be derived from the formulation of the response model
−1 T
ŷi = z(xi ) T b b = ZT W̄Z
Z W̄y
(A.8)
as follows
coefficients b is the diagonal elements of the covariance
matrix, which is defined as:
C = Var(b) = (ZT W̄Z)−1 ZT W̄Var(y)W̄Z(ZT W̄Z)−1 =
−1
(ZT W̄Z)−1 ZT W̄Iσ2 W̄Z ZT W̄Z
=
T
−1 T
T
−1
2
σ Z W̄Z
Z W̄W̄Z Z W̄Z
(A.9)
where the variance σ2 of the model is constant and is approximated by (Myers and Montgomery, 1995, p. 673):
Var(ŷi ) = xTi V ar(b)xi
where:
W̄ = WW
The matrix Var (b) is known as the covariance matrix and
is denoted by C in the present work. The variance of the
m
σ̂2 =
(yi − ŷi)2
i=1
m−p
(A.10)
The prediction variance of data point i is then defined as:
−1 T
−1
Z W̄W̄Z ZT W̄Z
xi
Var(ŷi ) = σ2 xTi ZT W̄Z
(A.11)