Download On the use of conditional expectation estimators

Survey
yes no Was this document useful for you?
   Thank you for your participation!

* Your assessment is very important for improving the workof artificial intelligence, which forms the content of this project

Document related concepts
no text concepts found
Transcript
New Developments in Pure and Applied Mathematics
On the use of conditional expectation estimators
Sergio Ortobelli, Tommaso Lando
difference between the two methods, that are actually aimed at
vs. the
different estimates (i.e. the mathematical function
) and therefore are not comparable. In
random variable
order to compare these two different methodologies, we
based
propose a method to estimate the distribution of
on the kernel non-parametric formula proposed by [1] and [2].
(which, for
Then, if we know the real distribution of
instance, can be easily computed in case of normality), then
we can perform a simulation analysis, drawing a bivariate
, and finally investigate which
random sample from
estimated distribution better fits to the true one.
The paper is organized as follows: in Section II we present
the different methodologies and their properties; in Section III
we examine a method to compare the two estimators, with
assumption of normality; in Section IV we briefly illustrate the
financial interpretation and possible application of the
conditional expected value.
Abstract— This paper discusses two different methods to
estimate the conditional expectation. The kernel nonparametric regression method allows to estimate the regression
function, which is a realization of the conditional expectation
. A recent alternative approach consists in estimating
the conditional expectation (intended as a random variable),
based on an appropriate approximation of the σ-algebra
generated by X. In this paper, we propose a new procedure to
estimate the distribution of the conditional expectation based
on the kernel method, so that it is possible to compare the two
approaches by verifying which one better estimates the true
. In particular, if we assume that the
distribution of
is normally distributed, then
two-dimensional variable
can be computed quite easily,
the true distribution of
and the comparison can be performed in terms of goodness-offit tests.
Keywords—Conditional Expectation, Kernel, Non Parametric,
II. METHODS
Regression.
In this section we describe two different procedures to
evaluate the conditional expected value between two random
and
be integrable random
variables. Let
. Let
variables in the probability space
be a random sample of
independent observations from the bi-dimensional variable
. The first procedure is aimed at estimating the
given
, which is a
conditional expectation of
mathematical function of ; the second method yields an
unbiased and consistent estimator of the random variable
.
The kernel non-parametric regression
It is well known that, if we know the form of the function
(e.g. polynomial, exponential, etc.), then
with several
we can estimate the unknown parameters of
methods (e.g. least squares). In particular, if we do not know
, except that it is a continuous and
the general form of
smooth function, then we can approximate it with a nonparametric method, as proposed by [1] and [2]. Thus,
can be estimated by:
I. INTRODUCTION
W
ithin a bivariate probabilistic framework, this paper
discusses different methods to estimate the conditional
expected value. On the one hand, several well known
methods are aimed at estimating the regression function
, which represents a realization of the
. In particular, the kernel nonrandom variable
parametric regression (see [1] and [2]) allows to estimate
as a locally weighted average, based on the
choice of an appropriate kernel function: the method yields
consistent estimators, provided that the kernel functions and
the random variable satisfy some conditions, described in
Section II. On the other hand, an alternative methodology was
recently introduced by [3] for estimating the random variable
: this method has been proved to be consistent without
requiring any regularity assumption. In this paper we stress the
This paper has been supported by the Italian funds ex MURST 60% 2014,
2015 and MIUR PRIN MISURA Project, 2013–2015, and ITALY project
(Italian Talented Young researchers). The research was also supported
through the Czech Science Foundation (GACR) under project 13-13142S and
through SP2013/3, an SGS research project of VSB-TU Ostrava, and
furthermore by the European Regional Development Fund in the
IT4Innovations Centre of Excellence, including the access to the
supercomputing capacity, and the European Social Fund in the framework of
CZ.1.07/2.3.00/20.0296 (to S.O.) and CZ.1.07/2.3.00/30.0016 (to T.L.).
S.O. Author is with University of Bergamo, via dei Caniana, 2, Bergamo,
Italy; and VŠB -TU Ostrava, Sokolská třída 33, Ostrava, Czech republic; email: [email protected].
T.L. Author is with University of Bergamo, via dei Caniana, 2, Bergamo,
Italy; and VŠB -TU Ostrava, Sokolská třída 33, Ostrava, Czech republic; email: [email protected].
ISBN: 978-1-61804-287-3
,
(1)
where
is a density function such that i)
;
; iii)
when
. The
ii)
is denoted by kernel, observe that kernel
function
functions are generally used for estimating probability
densities non-parametrically (see [4]). It was proved in [1] that
is quadratically integrable then
is a consistent
if
. In particular, observe that, if we denote by
estimator for
244
New Developments in Pure and Applied Mathematics
the joint density of
, the denominator of (1)
, while the
converges to the marginal density of
numerator converges to the function
(note that, if
The method, as defined by (3), requires only that is an
integrable random variable. From a practical point of view,
given n i.i.d. observations of , if we know the probability
corresponding to the i-th outcome , we obtain:
.
(5)
Otherwise, we can give uniform weight to each observation,
:
which yields the following consistent estimator of
is continuous, the
function
intended as a regular conditional probability).
has to be
,
The OLP method
We now describe an alternative non-parametric approach
[3] for approximating the conditional expectation, the method
is denoted by “OLP”, which is an acronym of the authors’
the σ-algebra generated by X (that is,
names. Define by
, where
is the
Borel σ-algebra on ). Observe that the regression function is
,
just a “pointwise” realization of the random variable
. The following
which can equivalently be denoted by
rather than
.
methodology is aimed at estimating
can be approximated by a σ-algebra generated by a
, we
suitable partition of . In particular, for any
of
consider the partition
where
is the number of elements of . Therefore, we are
, which in turn is a consistent
always able to estimate
.
estimator of the conditional expected value
III. COMPARISON IN CASE OF NORMALITY
If we assume that and
i.e.
of the random variable
that
are jointly normally distributed,
, we can obtain the distribution
quite easily. Indeed, we know
,
(7)
in
therefore, as
subsets, where b is an integer number greater than 1 and:
•
(6)
, we obtain that
,
.
(8)
•
•
Of course, if we simulate data from
and approximate
with the estimator
defined in (3), we can
finally compare the true and the theoretical (estimated)
distribution by performing a goodness-of-fit test. Differently,
the kernel non-parametric regression method does not allow to
, but only yields a consistent estimator of
estimate
. However, assume that the random variable
is
and moreover
(that is,
independent from
and
): in this case we can estimate
with
.
Starting with the trivial sigma algebra
, we can
obtain a sequence of sigma algebras generated by these
partitions, for different values of k (k=1,…,m,…). For
is the sigma algebra
instance,
,
generated by
s=1,...,b-1 and
. Generally:
,
(2)
and thereby we can also estimate the distribution of
,
. Obviously, the estimate depends
because
on the choice of the kernel function . It is proved that
converges
almost
surely
to
). Moreover, note that also
(
satisfies a weaker convergence property (convergence in
distribution). Indeed, we have that
,
(10)
thus we obtain that
.
Finally, it is possible to compare the two methods by
verifying which one better estimates the distribution of
, future studies will be focused on this issue.
Hence, it is possible to estimate the random variable
by
,
where
(3)
. Indeed, by definition of the
conditional expectation, we can easily verify that
the unique -measurable function such that, for any set
, (that can be seen as a union of disjoint sets, in
we obtain the equality
particular
is
IV. CONDITIONAL EXPECTATION AND FINANCIAL
APPLICATIONS
(4)
It is proved in [3] that
random variable
a.s.
The conditional expectation of a random variable given
another can be especially useful for financial applications. In
particular, we can use conditional expectation estimators
is a consistent estimator of the
, that is,
ISBN: 978-1-61804-287-3
(9)
245
New Developments in Pure and Applied Mathematics
either for ordering the investors choices as suggested by [3] or
to evaluate and exercise those arbitrage opportunities when
applies in the market.
We recall the classical definitions of first and second-orders
stochastic dominance.
First order stochastic dominance: X FSD Y if and only if
(9) yields a consistent estimator of the distribution function of
. In future work it will be possible to compare these
two methodologies by verifying which one better estimates the
, based on simulation analysis and
distribution of
goodness-of-fit tests. Moreover, we recall that these estimators
may have several financial applications such as ordering
investors’ opportunities or identifying arbitrage opportunities.
or, equivalently X FSD Y if and only if
for any increasing function .
Second order stochastic dominance (increasing concave
REFERENCES
[1]
order): X SSD Y if and only if
[2]
or, equivalently X SSD Y if and only if
for any increasing and concave function .
Obviously X FSD Y implies also X SSD Y.
If we assume that X and Y are, for instance, two different
gambles or investments, the financial interpretation of
stochastic orders follows straightforward. Indeed, in this case
X FSD Y means that X is stochastically “larger” than Y, while
X SSD Y indicates a larger expectation of gain and generally
an inferior “risk”. Those investors who prefer X to Y ,
provided that X SSD Y, are generally defined non-satiable risk
averse investors. The following property characterizes the
second order stochastic dominance in terms of conditional
expectation.
[3]
[4]
[5]
[6]
[7]
Super-martingale property. X SSD Y if and only if there exist
two random variables X', Y' defined on the same probability
space that have the same distribution of X and Y such that:
The proof of this property arises from the analysis proposed
by Strassen [5] and is a well-known result of ordering theory
(see also [6]-[7] and the references therein).
Thus, using the empirical evaluation of the conditional
expected value, we can attempt to order the investors choices
as suggested by [3]. On the other hand, using the fundamental
theorem of arbitrage, we know that there exist no arbitrage
opportunities in the market if there exists a risk neutral
martingale measure under which the discounted price process
results a martingale. So, when we assume that the filtration
is the one generated by the price process {Xt}t>0 (assumed
)=
).
to be a Markov process) then we get that
Therefore, this property the contidional expected value
estimator and the fundamental theorem of arbitrage can be
used to estimate the risk neutral measure and the presence of
arbitrage opportunities in the market.
V. CONCLUSION
In this paper, we deal with two methodologies for
estimating the conditional expectation, studying their
properties and analyzing their differences. We also propose a
based on the
procedure to estimate the distribution of
kernel method. We observe that the OLP method proposed by
[3] yields a consistent estimator of the random variable
, while the generalized kernel method, proposed in eq.
ISBN: 978-1-61804-287-3
246
E. A. Nadaraya, “On estimating regression,” Theory of Probability and
its Applications, vol. 9, no. 1, pp. 141-142, 1964.
G. S. Watson, “Smooth regression analysis,” Sankhya, Series A, vol. 26,
no. 4, pp. 359-372, 1964.
S. Ortobelli, F. Petronio, T. Lando, “A portfolio return definition
coherent with the investors preferences,” under revision in IMA-Journal
of Management Mathematics.
V. A. Epanechnikov, “Non-parametric estimation of a multivariate
probability density,” Theory of Probability and its Applications, vol. 14,
no. 1, pp. 153-158, 1965.
V. Strassen, “The existence of probability measures with given
marginals,” Ann. Math. Statist., vol. 36, no. 2, pp. 423-439, 1965.
A. Müller, D. Stoyan, Comparison methods for stochastic models and
risks, New York: John Wiley & Sons, 2002.
M. Shaked, G. Shanthikumar, Stochastic orders and their applications.
New York: Academic Press Inc. Harcourt Brace & Company, 1994.