Proofs Section 1.1 (Initial results to LF-duality)

post by Diffractor · 2020-08-27T07:59:12.512Z · LW · GW · 0 comments

Contents

No comments

Fair upfront warning: This is not a particularly readable proof section (though much better than Section 2 about belief functions). There's dense notation, logical leaps due to illusion of transparency since I've spent a month getting fluent with these concepts, and a relative lack of editing since it's long. If you really want to read this, I'd suggest PM-ing me to get a link to MIRIxDiscord, where I'd be able to guide you through it and answer questions.

 

Proposition 1: If  then  is a positive functional on .

Proof Sketch: We just check three conditions. Linearity, being nonnegative on , and continuity.

Linearity proof. Using  for constants,

So we have verified that  and we have linearity.

Positivity proof: An sa-measure , writeable as  has  uniquely writeable as a pair of finite measures  (all the positive regions) and a  (all the negative regions) by the Jordan Decomposition Theorem, and . So,

The first  by , so the expectation of  is positive and  is negative so taking the expectation of 1 is more negative. The second  is by the condition on how  relates to .

Continuity proof: Fix a sequence  converging to . Obviously the  part converges, so now we just need to show that  converges to . The metric we have on the space of finite signed measures is the KR-metric, which implies the thing we want. This only works for continuous , not general .

 

Theorem 1: Every positive functional on  can be written as , where , and 

Proof Sketch: The first part is showing that it's impossible to have a positive functional where the  term doesn't matter, without the positive functional being the one that maps everything to 0. The second part of the proof is recovering our  by applying the positive functional to Dirac-delta measures , to see what the function must be on point .

Part 1:  Let's say  isn't 0, ie there's some nonzero  pair where , and yet  (which, by linearity, means that  for all ). We'll show that this situation is impossible.

Then,  by our starting assumption, and Jordan decomposition of , along with linearity of positive functionals. Now,  because positive functionals are linear, and everything in that above equation is an sa-measure (flipping a negative measure makes a positive measure, which doesn't impose restrictions on the  term except that it be ).  And so, by nonnegativity of positive functionals on sa-measures, . Using this, we get

Another use of linearity was invoked for the first  in the second line, and then the second  made use of our assumption that  for all .

At this point, we have derived that . Both of these are positive measures. So, there exists some positive measure  where .

Now, observe that, for all 

Let b be sufficiently huge to make  into an sa-measure. Also, since , which is impossible because positive functionals are nonnegative on all sa-measures. Contradiction. Due to the contradiction, if there's a nonzero positive functional, it must assign , so let  be our  term.

Proof part 2: Let's try to extract our f. Let  This is just recovering the value of the hypothesized  on  by feeding our positive functional the measure  that assigns 1 value to  and nothing else, and scaling. Now, we just have to verify that this  is continuous and in .

For continuity, let  limit to . By the KR-metric we're using,  limits to . By continuity of  limits to . Therefore,  limits to  and we have continuity.

For a lower bound, , because  is a ratio of two nonnegative numbers, and the denominator isn't 0.

Now we just have to show that . For contradiction, assume there's an  where . Then , so , and in particular, .

But then, , so 

However,  is an sa-measure, because , and must have nonnegative value, so we get a contradiction. Therefore, .

To wrap up, we can go:

And , and , so we're done.

 

Lemma 1: Compactness Lemma: Fixing some nonnegative constants  and , the set of sa-measures where , is compact. Further, if a set lacks an upper bound on  or on , it's not compact.

Proof Sketch: We fix an arbitrary sequence of sa-measures, and then use the fact that closed intervals are compact-complete and the space  is compact-complete to isolate a suitable convergent subsequence. Since all sequences have a limit point, the set is compact. Then, we go in the other direction, and get a sequence with no limit points assuming either a lack of upper bounds on , or a lack of upper bounds on .

Proof: Fix some arbitrary sequence  wandering about within this space, which breaks down into , and then, since all measures are just a probability distribution scaled by the constant , it further breaks down into . Since  must be bounded in .

Now, what we can do is extract a subseqence where  ,, and  all converge, by Tychonoff's Theorem (finite product, no axiom of choice required) Our three number sequences are all confined to a bounded interval, and our two probability sequences are wandering around within  which is a compact complete metric space if  is. The limit of this subsequence is a limit point of the original sequence, since all its components are arbitrarily close to the components that make up  for large enough n in our subsequence.

The limiting value of  and  both obey their respective bounds, and the cone of sa-measures is closed, so the limit point is an sa-measure and respects the bounds too. Therefore the set is compact, because all sequences of points in it have a limit point.

In the other direction, assume a set  has unbounded  values. Then we can fix a sequence  where  increases without bound, so the a-measures can't converge. The same applies to all subsequences, so there's no limit point, so  isn't compact.

Now, assume a set  has bounded  values, call the least upper bound , but the value of  is unbounded. Fix a sequence  where  is unbounded above. Assume a convergent subsequence exists. Since  must be bounded in . Then because , and the latter quantity is finite,  must be unbounded above. However, in order for the  to limit to some , which results in a contradiction. Therefore, said convergent subsequence doesn't exist, and  is not compact.

Put together, we have a necessary-and-sufficient condition for a closed subset of  to be compact. There must be an upper bound on  and , respectively.

 

Lemma 2: The upper completion of a closed set of sa-measures is closed.

Proof sketch: We'll take a convergent sequence  in the upper completion of  that limits to , and show that, in order for it to converge, the same sorts of bounds as the Compactness Lemma uses must apply. Then, breaking down  into , where , and  is an sa-measure, we'll transfer these Compactness-Lemma-enabling bounds to the sequences  and , to get that they're both wandering around in a compact set. Then, we just take a convergent subsequence of both, add the two limit points together, and get our limit point , witnessing that it's in the upper completion of .

Proof: Let  limit to some . A convergent sequence (plus its one limit point) is a compact set of points, so, by the Compactness Lemma, there must be a  and  that are upper bounds on the  and  values, respectively.

Now, for all n, break down  as , where , and  is an sa-measure.

Because , we can bound the  and  quantities by . This transfers into a  lower bound on  and , respectively.

Now, we can go:

Using worst-case values for  and , we get:

So, we have upper bounds on  and  of , respectively.

Due to the sequences  and  respecting bounds on  and  ( and  respectively), and wandering around within the closed sets  and  respectively, we can use the Compactness Lemma and Tychonoff's theorem (finite product, no axiom of choice needed) to go "hey, there's a subsequence where both  and  converge, call the limit points  and . Since  and  are closed, , and ."

Now, does ? Well, for any , there's some really large n where , and . Then, we can go:

So, regardless of , so . So, we've written  as a sum of an sa-measure in  and an sa-measure, certifying that , so  is closed.

 

Proposition 2: For closed convex nonempty ,

Proof sketch: Show both subset inclusion directions. One is very easy, then we assume the second direction is false, and invoke the Hahn-Banach theorem to separate a point in the latter set from the former set. Then we show that the separating functional is a positive functional, so we have a positive functional where the additional point underperforms everything in , which is impossible by the definition of the latter set.

Easy direction: We will show that 

This is because a , can be written as . Let  be our  of interest. Then, it is indeed true that for all 

Hard direction: Assume by contradiction that

Then there's some  where  and  is the upper completion of a closed set, so by the Compactness Lemma, it's closed, and since it's the Minkowski sum of convex sets, it's convex.

Now, we can use the variant of the Hahn-Banach theorem from the Wikipedia article on "Hahn-Banach theorem", in the "separation of a closed and compact set" section. Our single point  is compact, convex, nonempty, and disjoint from the closed convex set . Banach spaces are locally convex, so we can invoke Hahn-Banach separation.

Therefore, there's some continuous linear functional  s.t. 

We will show that this linear functional is actually a positive functional!

Assume there's some sa-measure  where . Then we can pick a random , and consider , where  is extremely large.  lies in , but it would also produce an extremely negative value for \phi which undershoots  which is impossible. So  is a positive functional.

However, , so . But also,  fulfills the condition , because of the set it came from. So, there must exist some  where . But, we have a contradiction, because .

So, there cannot be any point in  that isn't in . This establishes equality.

 

Lemma 3: For any closed set  and point , the set  is nonempty and compact.

Proof: It's easy to verify nonemptiness, because  is in the set. Also, it's closed because it's the intersection of two closed sets.  was assumed closed, and the other part is the Minkowski sum of  and , which is closed if  is, because it's just a shift of  (via a single point).  is closed because it's -1 times a closed set.

We will establish a bound on the  and  values of anything in the set, which lets us invoke the Compactness Lemma to show compactness, because it's a closed subset of a compact set.

Note that if , then , so . Rewrite this as 

Because , we can bound  and  by . This transfers into a  lower bound on  and . Now, we can go:

Using worst-case values for  and , we get:

So, we have an upper bound of  on , and an upper bound of  on . Further,  was arbitrary in , so we have our bounds. This lets us invoke the Compactness Lemma, and conclude that said closed set is compact.

 

Lemma 4: If  is a partial order on  where  iff there's some sa-measure  where , then

Proof: 

Also, 

Also, 

Putting all this together, we get

And we're halfway there. Now for the second half.

Also, 

Putting this together, we get

And the result has been proved.

 

Theorem 2: Given a nonempty closed set , the set of minimal points  is nonempty and all points in  are above a minimal point.

Proof sketch: First, we establish an partial order that's closely tied to the ordering on , but flipped around, so minimal points in  are maximal elements. We show that it is indeed a partial order, letting us leverage Lemma 4 to translate between the partial order and the set . Then, we show that every chain in the partial order has an upper bound via Lemma 3 and compactness arguments, letting us invoke Zorn's lemma to show that that everything in the partial order is below a maximal element. Then, we just do one last translation to show that minimal points in  perfectly correspond to maximal elements in our partial order.

Proof: first, impose a partial order on , where  iff there's some sa-measure  where . Notice that this flips the order. If an sa-measure is "below" another sa-measure in the sa-measure addition sense, it's above that sa-measure in this ordering. So a minimal point in  would be maximal in the partial order. We will show that it's indeed a partial order.

Reflexivity is immediate. , so .

For transitivity, assume . Then there's some  and  s.t. , and . Putting these together, we get , and adding sa-measures gets you an sa-measure, so .

For antisymmetry, assume  and . Then , and . By substitution, , so . For all positive functionals, , and since positive functionals are always nonnegative on sa-measures, the only way this can happen is if  and  are 0, showing that .

Anyways, since we've shown that it's a partial order, all we now have to do is show that every chain has an upper bound in order to invoke Zorn's lemma to show that every point in  lies below some maximal element.

Fix some ordinal-indexed chain , and associate each of them with the set , which is compact by Lemma 3 and always contains .

The collection of  also has the finite intersection property, because, fixing finitely many of them, we can consider a maximal , and  is in every associated set by:

Case 1: Some other  equals , so  and .

Case 2: , and by Lemma 4, .

Anyways, since all the  are compact, and have the finite intersection property, we can intersect them all and get a nonempty set containing some point  lies in , because all the sets we intersected were subsets of . Also, because  for all  in our chain, then if , Lemma 4 lets us get , and if , then . Thus,  is an upper bound for our chain.

By Zorn's Lemma, because every chain has an upper bound, there are maximal elements in , and every point in  has a maximal element above it.

To finish up, use Lemma 4 to get: 

 

Proposition 3: Given a , and a  that is nonempty closed, 

Direction 1: since  is a subset of , we get one direction easily, that

Direction 2: Take a . By Theorem 2, there is a  s.t. . Applying our positive functional  (by Proposition 1), we get that . Because every point in  has a point in  which scores as low or lower according to the positive functional,

And this gives us our desired equality.

 

Proposition 4: Given a nonempty closed convex  and 

Proof: First, we'll show . We'll use the characterization in terms of the partial order  we used for the Zorn's Lemma proof of Theorem 2. If a point  is in , then it can be written as , so . Since all points added in  lie below a preexisting point in  (according to the partial order from Theorem 2) the set of maximals (ie, set of minimal points) is completely unchanged when we add all the new points to the partial order via upper completion, so .

For the second part, one direction is immediate. , so . For the reverse direction, take a point . It can be decomposed as , and then by Theorem 2,  can be decomposed as , so , so it lies in , and we're done.

 

Theorem 3: If the nonempty closed convex sets  and  have , then there is some  where 

Proof sketch: We show that upper completion is idempotent, and then use that to show that the upper completions of A and B are different. Then, we can use Hahn-Banach to separate a point of  from  (or vice-versa), and show that the separating functional is a positive functional. Finally, we use Theorem 1 to translate from a separating positive functional to different expectation values of some 

Proof: Phase 1 is showing that upper completion is idempotent. . One direction of this is easy, . In the other direction, let . Then we can decompose  into , where , and decompose that into  where , so  and .

Now for phase 2, we'll show that the minimal points of one set aren't in the upper completion of the other set. Assume, for contradiction, that this is false, so  and . Then, by idempotence, Proposition 4, and our subset assumption,

Swapping the A and B, the same argument holds, so , so .

Now, using this and Proposition 4, .

But wait, we have a contradiction, we said that the minimal points of  and  weren't the same! Therefore, either , or vice-versa. Without loss of generality, assume that .

Now for phase 3, Hahn-Banach separation to get a positive functional with different inf values. Take a point  in  that lies outside . Now, use the Hahn-Banach separation of  and  used in the proof of Proposition 2, to get a linear functional  (which can be demonstrated to be a positive functional by the same argument as the proof of Proposition 2) where: . Thus, , so 

Said positive functional can't be 0, otherwise both sides would be 0. Thus, by Theorem 1,  where , and . Swapping this out, we get:

and then this is  So, we have crafted our  which distinguishes the two sets and we're done.

 

Corollary 1: If two nonempty closed convex upper-complete sets  and  are different, then there is some  where 

Proof: Either , in which case we can apply Theorem 3 to separate them, or their sets of minimal points are the same. In that case, by Proposition 4 and upper completion,  and we have a contradiction because the two set are different.

 

Theorem 4: If  is an infradistribution/bounded infradistribution, then  is concave in , monotone, uniformly continuous/Lipschitz, , and if 

Proof sketch:  is trivial, as is uniform continuity from the weak bounded-minimal condition. For concavity and monotonicity, it's just some inequality shuffling, and for  if ,, we use upper completion to have its worst-case value be arbitrarily negative. Lipschitzness is much more difficult, and comprises the bulk of the proof. We get a duality between minimal points and hyperplanes in , show that all the hyperplanes we got from minimal points have the same Lipschitz constant upper bound, and then show that the chunk of space below the graph of  itself is the same as the chunk of space below all the hyperplanes we got from minimal points. Thus,  has the same (or lesser) Lipschitz constant as all the hyperplanes chopping out stuff above the graph of .

Proof: For normalization,  and  by normalization for . Getting the uniform continuity condition from the weak-bounded-minimal condition on an infradistribution  is also trivial, because the condition just says  is uniformly continuous, and that's just  itself.

Let's show that  is concave over , first. We're shooting for . To show this,

And concavity has been proved.

Now for monotonicity. By Proposition 3 and Proposition 1,

Now, let's say . Then:

And we're done. The critical inequality in the middle came from all minimal points in an infradistribution having no negative component by positive-minimals, so swapping out a function for a greater function produces an increase in value.

Time for . Let's say there exists an  s.t. . We can take an arbitrary sa-measure , and consider , where  is the point measure that's 1 on , and  is extremely huge. The latter part is an sa-measure. But then,. Since , and  is extremely huge, this is extremely negative. So, since there's sa-measures that make the function as negative as we wish in  by upper-completeness,  A very similar argument can be done if there's an  where , we just add in  to force arbitrarily negative values.

Now for Lipschitzness, which is by far the worst of all. A minimal point  induces an affine function  (kinda like a hyperplane) of the form . Regardless of , as long as it came from a minimal point in  for functions with range in , because

Ok, so if a point is on-or-below the graph of  over , then it's on-or-below the graph of  for all .

What about the other direction? Is it possible for a point  to be strictly above the graph of  and yet  all the graphs of ? Well, no. Invoking Proposition 3,

So, there exists a minimal point  where , so  lies above the graph of .

Putting these two parts together, 's hypograph over  is the same as the intersection of the hypographs of all these . If we can then show all the  have a Lipschitz constant bounded above by some constant, then we get that  itself is Lipschitz with the same constant.

First, a minimal  must have  having no negative parts, so it can be written as , and by bounded-minimals (since we have a bounded infradistribution), . Now,

So, we get that: 

Note that  is our distance metric between functions in . This establishes that regardless of which minimal point we picked,  is Lipschitz with Lipschitz constant , and since , then  itself has the same bound on its Lipschitz constant.

 

Lemma 5: 

Proof sketch: We'll work in the Banach space  of  measurable functions w.r.t the absolute value of the signed measure . Then, we consider the discontinuous (but ) function that's 1 everywhere where  is negative. Continuous functions are dense in  measurable functions, so we can fix a sequence of continuous functions limiting to said indicator function. Then we just have to check that  is a bounded linear functional, and we get that there's a sequence of continuous functions  where  limits to the measure of the indicator function that's 1 where everything is negative. Which is the same as the measure of the "always 1" function, but only on the negative parts, and we're done.

Consider the Banach space  of measurable functions w.r.t. the absolute value of the signed measure , ie, , which is a measure. It has a norm given by . To begin with, we can consider the  indicator function  that's 1 where the measure is negative. Note that

Because continuous functions are dense in , we can fix a sequence of continuous functions  limiting to . Then, just clip those continuous functions to , making a continuous function . They'll get closer to  that way, so the sequence  of continuous functions  limits to  too.

We'll take a detour and show that  is a bounded linear functional , with a Lipschitz constant of 1 or less.

First, , trivially, establishing linearity.

As for the boundedness, if , then , so:

So, . An  having a norm of 1 or less gets mapped to a number with a norm of 1 or less, so the Lipschitz constant of  is 1 or less. This implies continuity.

Now that we have all requisite components, fix some . There's some n where, for all greater n, . Mapping them through , due to having a Lipschitz constant of 1 or less, then means that  because the value of 1-but-only-on-negative-parts is as-or-more negative than  on the measure, due to  being bounded in . Summarizing,  for all n beyond a certain point, so, for all n beyond a certain point, 

So we have a sequence of functions in  where  limits to , and our signed measure was arbitrary. Therefore, we have our result that .

 

Theorem 5: If  is a function  that is concave, monotone, uniformly-continuous/Lipschitz, , and , then it specifies a infradistribution/bounded infradistribution by: , where  is the function given by , and  is the convex conjugate of . Also, going from a infradistribution to an  and back recovers exactly the infradistribution, and going from an  to a infradistribution and back recovers exactly .

Proof sketch: This is an extremely long one. Phase 1 and 2 is showing isomorphism. One direction is reshuffling the definition of  until we get the definition of the set built from  via convex conjugate, showing that going  to  and back recovers your original set. In the other direction, we show that expectations w.r.t the set we built from  match up with  exactly.

Phase 3 is cleanup of the easy conditions. Nonemptiness is pretty easy to show, the induced set being a set of sa-measures is harder to show and requires moderately fancier arguments, and closure and convexity require looking at basic properties of functions and the convex conjugate. Upper completeness takes some equation shuffling to show but isn't too bad. The weak-minimal bound property is immediate, and normalization is fairly easy. 

That just leaves the positive-minimal property and the bounded-minimal properties, respectively, which are nightmares. A lesser nightmare and a greater nightmare. For phase 4 to lay the groundwork for these, we establish an isomorphism between points in H and hyperplanes which lie above the graph of h, as well as a way of certifying that a point in H isn't minimal by what its hyperplane does.

Phase 5 is, for showing positive-minimals, we can tell whether a hyperplane corresponds to an a-measure, and given any hyperplane above the graph of , construct a lower one that corresponds to a lower point in  that does correspond to an a-measure

Phase 6 is, for bounded-minimals, we take a hyperplane that may correspond to a minimal point, but which is too steep in certain directions. Then, we make an open set that fulfills the two roles of: if you enter it, you're too steep, or you overshoot the hyperplane of interest that you're trying to undershoot. Some fancy equation crunching and one application of Hahn-Banach later, we get a hyperplane that lies above  and doesn't enter our open set we crafted. So, in particular, it undershoots our hyperplane of interest, and isn't too steep. This certifies that our original "too steep" hyperplane didn't actually correspond to a minimal point, so all minimal points must have a bound on their  values by the duality between hyperplanes above  and points in .

Fix the convention that  or  is assumed to mean , we'll explicitly specify when  has bounds.

Phase 1: Let's show isomorphism. Our first direction is showing  to  and back is  exactly. By upper completion, and Proposition 2, we can also characterize  as

Using Theorem 1 to express all positive functionals as arising from an , and observing that the a constant in front doesn't change which stuff scores lower than which other stuff, so we might as well characterize everything in terms of  can also be expressed as

We can swap out  for , because, from the  argument in Theorem 4,  going outside  means that . And then, our  can further be reexpressed as

Also, , so we can rewrite this as:

and, by the definition of the convex conjugate(sup characterization) and the space of finite signed measures being the dual space of , and  being a functional applied to an element, this is  So, our original set  is identical to the convex-conjugate set, when we go from  to  back to a set of sa-measures.

Proof Phase 2: In the reverse direction for isomorphism, assume that  fulfills the conditions. We want to show that , so let's begin.

Given an , we have a natural candidate for minimizing the , just set it equal to . So then we get 

And this is just...  (proof by Wikipedia article, check the inf characterization), and, because  is continuous over , and concave, and  everywhere outside the legit functions then  is continuous over , and convex, and  everywhere outside the legit functions, so in particular,  is convex and lower-semicontinuous and proper, so  by the Fenchel-Moreau Theorem. From that, we get

and we're done with isomorphism. Now that isomorphism has been established, let's show the relevant conditions hold. Namely, nonemptiness, closure, convexity, upper completion, normality, weak-bounded-minimals (phase 3) and positive-minimals (phase 5) and bounded-minimals (assuming h is Lipschitz) (phase 6) to finish off. The last two will be extremely hard.

Begin phase 3. Weak-bounded-minimals is easy by isomorphism. For our  we constructed, if  wasn't uniformly continuous, then because  equals , we'd get a failure of uniform continuity for , which was assumed.

By the way, the convex conjugate, , can be expressed as (by Wikipedia, sup charaacterization)  We can further restrict  to functions with range in , because if it was anything else, we'd get . We'll be using  (or the  variant) repeatedly.

For nonemptiness, observe that  is present in  because, fixing an arbitrary ,

This is from our format of the convex conjugate, and  being normalized and monotone, so the highest it can be is  and it attains that value. Therefore, , so  is in the  we constructed.

For showing that our constructed set  lies in , we have that, for a random , it has (by our characterization of )

This is by the lower bound on  being  and unpacking the convex conjugate,  by monotonicity and normalization, a reexpression of , and Lemma 5, respectively.  so it's an sa-measure.

For closure and convexity, by monotonicity of , we have   and  is continuous on , concave, and  everywhere else by assumption, so  is proper, continuous on , convex, and lower-semicontinuous in general because of the  everywhere else, so, by the Wikipedia page on "Closed Convex Function",  is a closed convex function, and then by the Wikipedia page on "Convex Conjugate" in the Properties section,  is convex and closed. From the Wikipedia page on "Closed Convex Function", this means that the epigraph of  is closed, and also the epigraph of a convex function is convex. This takes care of closure and convexity for our 

Time for upper-completeness. Assume that  lies in the epigraph. Our task now is to show that  lies in the epigraph. This is equivalent to showing that . Note that , because  is an sa-measure. Let's begin.

This was done by unpacking the convex conjugate, splitting up  into  and , locking two of the components in the sup to be an upper bound (which also gives the sup more flexibility on maximizing the other two components, so this is greater), packing up the convex conjugate, and using that  because 

Normalization of the resulting set is easy. Going from  to a (maybe)-inframeasure  back to  is identity as established earlier, so all we have to do is show that a failure of normalization in a (maybe)-inframeasure makes the resulting  not normalized. Thus, if our  is normalized, and it makes an  that isn't normalized, then going back makes a non-normalized , which contradicts isomorphism. So, assume there's a failure of normalization in . Then , or , so either  or  and we get a failure of normalization for  which is impossible. So  must be normalized.

Begin phase 4. First, continuous affine functionals  that lie above the graph of  perfectly correspond to sa-measures in . This is because the continuous dual space of  is the space of finite signed measures, so we can interpret  as a finite signed measure, and  as the  term. In one direction, given an ,

so every point in  induces a continuous affine functional  whose graph is above .

In the other direction, from earlier, we can describe  as: 

and then, for ,

because . So continuous affine functionals whose graph lies above the graph of  correspond to points in .

So, we have a link between affine functionals that lie above the graph of , and points in . What would a minimal point correspond to? Well, a non-minimal point corresponds to , where the latter component is nonzero. There's some  where  due to the latter component being nonzero, and for all . Using Theorem 1 to translate positive functionals to , this means that the  induced by  lies below the affine functional induced by  over the . So, if there's a different affine functional  s.t. , then  must correspond to a nonminimal point.

Further, we can characterize whether  corresponds to an a-measure or not. For a measure, if you increase your function you're feeding in, you increase the value you get back out, . For a signed measure with some negative component, Lemma 5 says we can find some  that attain negative value, so you can add one of those  to your  and get . So, a  corresponds to an a-measure exactly when it's monotone.

Phase 5: Proving positive-minimals. With these links in place, this means we just have to take any old point that's an sa-measure in , get a  from it, it'll fulfill certain properties, and use those properties to find a  that lies below  and above  on  and is monotone, certifying that  corresponds to a point below our minimal-point of interest that's still in  but is an a-measure, so we have a contradiction.

To that end, fix a  that corresponds to some point in  that's not an a-measure (in particular, it has a negative component), it lies above the graph of .

Now, translate  to a , where , and  is minimized at some . Since our  corresponds to something that's not an a-measure,  

Let our affine continuous functional  be defined as 

In order to show that  corresponds to an a-measure below  in , we need three things. One is that  is monotone (is an a-measure), two is that it lies below  over  and three is that it lies above . Take note of the fact that , because .

For monotonicity of , it's pretty easy. If , then

and we're done with that part.

For being less than or equal to  over  (we know it's not the same as  because  isn't monotone and  is),

For being  over  it takes a somewhat more sophisticated argument. By Lemma 5, regardless of , there exists a  where . Then, we can go:

The last steps were done via the definition of , and  being monotonic.

So,  for all  and all  getting  for  (because  is  everywhere else)

Thus,  specifies an a-measure ( being monotone) that is below the sa-measure encoded by  (by  over ), yet , so said point is in . This witnesses that there can be no minimal points in  that aren't a-measures. That just leaves getting the slope bound from Lipschitzness, the worst part of this whole proof.

Phase 6: Let  be the Lipschitz constant for . Fix a  that corresponds to a minimal point with . This violates the Lipschitz bound when traveling from 0 to 1, so the Lipschitz bound is violated in some direction. Further, the graph of  touches the graph of  at some point , because if it didn't, you could shift  down further until it did touch, witnessing that the point  came from wasn't minimal (you could sap more from the  term).

Now, if this point is minimal, it should be impossible to craft a  which is  over , and different from . We shall craft such a , witnessing that said point isn't actually minimal. Further, said  won't violate the Lipschitz bound in any direction. Thus, all affine functionals corresponding to minimal points must obey the Lipschitz bound and be monotone, so they're a-measures with .

In order to do this, we shall craft three sets in , and .

Set  is . Pretty much, this set is the hypograph of . It's obviously convex because  is concave, and the hypograph of a concave function is convex. It's closed because  is continuous.

Set  is . This could be thought of as the the interior of the epigraph of  restricted to . Undershooting this means you never exceed  over . First, it's open. This is because, due to  being continuous over a compact set , the maximum and minimum are attained, so any  is bounded below 1 and above 0, so we've got a little bit of room to freely wiggle  in any direction. Further, since  is a continuous linear functional on  which is a Banach space, it's a bounded linear functional and has some Lipschitz constant (though it may exceed ), so we have a little bit of room to freely wiggle  as well. So  is open.

Also,  is convex, because a mixture of  and  that are bounded away from 0 and 1 is also bounded away from 0 and 1, and .

Set  is . This could be thought of as an open cone with a point (it's missing that exact point, though) at , that opens straight up, and certifies a failure of the  bound on slope. If an affine function includes the point  in its graph, then if it increases faster than  in any direction, it'll land in this set. It's open because, given a point in it, we can freely wiggle the  and  values around a little bit in any direction, and stay in the set. Now we'll show it's convex. Given an  and  in it, due to  being a Banach space (so it has a norm), we want to check whether .

Observe that (using the defining axioms for a norm)

So,  is convex.

Ok, so we've got a convex closed set and two convex opens. Now, consider . The convex hull of an open set is open. We will show that .

Assume this is false, and that they overlap. The point where they overlap, can then be written as a convex mixture of points from . However,  and  are both convex, so we can reduce it to a case where we're mixing one point  from  and one point  in . And .

If , then we've just got a single point in . Also, .

This is because  and  has a Lipschitz constant of , so it can't increase as fast as we're demanding as we move from  to , which stays in . So .

If , then we've just got a single point in . Then , so again, .

For the case where  isn't 0 or 1, we need a much more sophisticated argument. Remembering that , and , we will show that  lies strictly above the graph of . Both  and  lie in , so their mix lies in the same set, so we don't have to worry about  being undefined there. Also, remember that  over . Now,

The critical > is by the definition of , and . So, the  term is strictly too high for this point (different than the one we care about) to land on the graph of .

With the aid of this, we will consider "what slope do we have as we travel from  to "? Said slope is

That critical > is by  and the definition of 

So, if we start at  (and  lies in ), we're above the graph of . Then, we travel to , where  by assumption that this point is in , but while doing this, we ascend faster than , the Lipschitz constant for . So, our point of interest  lies above the graph of  and can't lie in , and we have a contradiction.

Putting all this together, . Since  is open, and they're both convex and nonempty, we can invoke Hahn-Banach (first version of the theorem in the "Separation of Sets" section)and conclude they're separated by some continuous linear functional . Said linear functional must increase as  does, because , and  (for some sufficiently large ) lies in , thus in . This means that given any  and  to specify a level, we can find a unique  where .

So, any level set of this continuous linear functional we crafted can also be interpreted as an affine functional. There's a critical value of the level set that achieves the separation, . This is because , but  is in , thus in , for all . So we've uniquely pinned down which affine function  we're going for. Since the graph of  is a hyperplane separating  and  (It may touch the set , just not cut into it, but it doesn't touch ), from looking at the definitions of  and  and , we can conclude:

From the definition of , so  over .

From the definition of  over , and they're both continuous, so we can extend  to  by continuity, so  over .

Also, , so , and this, paired with the ability of  to detect whether an affine function exceeds the  slope bound (as long as the graph of said function goes through ), means that the graph of  not entering  certifies that its Lipschitz constant is  or less. Since \phi does enter  due to violating the Lipschitz constant bound, this also certifies that .

Putting it all together, given a  which corresponds to a minimal point and violates the Lipschitz bound, we can find a  below it that's also above , so said minimal point isn't actually minimal.

Therefore, if you were to translate a minimal point in the induced  into an affine function above h, it'd have to A: not violate the Lipschitz bound (otherwise we could undershoot it) and B: be monotone (otherwise we could undershoot it). Being monotone certifies that it's an a-measure, and having a Lipschitz constant of  or less certifies that the  of the a-measure is  or less. We're finally done!

The next proofs are here. [AF · GW]

0 comments

Comments sorted by top scores.