Suppose we have a random vector and a convex set
such that
If you are doing things with convexity, then you may wonder whether
This is certainly true if only takes finitely many value in
or
is closed. In the first case, you just verify the definition of convexity and the second case, you may use the strong law of large numbers. But if you draw a picture and think for a while, you might wonder whether these conditions are needed as it looks like no matter what value
takes, it can not go out of
and the average should still belong to
as long as
is convex. In this post, we are going to show that it is indeed the case and we then have a theorem.
Theorem 1 For any convex set
, and for any random vector
such that
its expectation is still in
, i.e,
as long as the mean exists.
Skip the following remark if you don’t know or not familiar with measure theory.
Remark 1 If you are a measure theoretic person, you might wonder whether
should be Borel measurable. The answer is no. The set
needs not to be Borel measurable. To make the point clear, suppose there is an underlying probability
and
is a random variable from this probability space to
where
is the borel sigma-algebra. Then we can either add the condition that the event
or
is understood as
is a measurable event with respect to the completed measure space
and we overload the notation
to mean
. The probability space
is completed by the probability measure
.
To have some preparation for the proof, recall the separating hyperplane theorem of convex set.
Theorem 2 (Separating Hyperplane theorem) Suppose
and
are convex sets in
and
, then there exists a nonzero
such that
for all
.
Also recall the following little facts about convexity.
- Any convex set in
is always an interval.
- Any affine space of
dimension in
is of the form
for some
and
.
We are now ready to prove Theorem 1.
Proof of Theorem 1: We may suppose that has nonempty interior. Since if it is not, we can take the affine plane containing
with smallest dimension. Suppose
is not in
, then by separating hyperplane theorem, there exists a nonzero
such that
Since almost surely, we should have
almost surely. Since , we see that
with probability
. Since intersection of the hyperplane of
and
is still convex, we see that that
only takes value in a convex set in a
dimensional affine space.
Repeat the above argument, we can decrease the dimension until . After a proper translation and rotation, we can say that
takes its value in an interval in
and we want to argue that the mean of
is always in the interval.
This is almost trivial. Suppose the interval is bounded. If the interval is closed, then since taking expectation preserves order, i.e., , we should have its mean in the interval. If the interval is half open and half closed and if the means is not in the interval, then
must be the open end of the interval since expectation preserves order, but this means that
has full measure on the open end which contradicts the assumption that
is in the interval with probability one. The case both open is handled in the same way. If the interval is unbounded one way, then the previous argument still works and if it is just
, then for sure that
. This completes the proof.
Nice!
LikeLike