<?xml version="1.0" encoding="UTF-8"?>
<rss version="2.0"
	xmlns:content="http://purl.org/rss/1.0/modules/content/"
	xmlns:wfw="http://wellformedweb.org/CommentAPI/"
	xmlns:dc="http://purl.org/dc/elements/1.1/"
	xmlns:atom="http://www.w3.org/2005/Atom"
	xmlns:sy="http://purl.org/rss/1.0/modules/syndication/"
	xmlns:slash="http://purl.org/rss/1.0/modules/slash/"
	>

<channel>
	<title>Information Theory b-log</title>
	<atom:link href="http://blogs.princeton.edu/blogit/feed/" rel="self" type="application/rss+xml" />
	<link>http://blogs.princeton.edu/blogit</link>
	<description></description>
	<lastBuildDate>Wed, 01 May 2013 19:43:03 +0000</lastBuildDate>
	<language>en-US</language>
	<sy:updatePeriod>hourly</sy:updatePeriod>
	<sy:updateFrequency>1</sy:updateFrequency>
			<item>
		<title>Claude Shannon Centennial Stamp U. S. Postal Service</title>
		<link>http://blogs.princeton.edu/blogit/2013/05/01/claude-shannon-centennial-stamp-u-s-postal-service/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/05/01/claude-shannon-centennial-stamp-u-s-postal-service/#comments</comments>
		<pubDate>Wed, 01 May 2013 19:43:03 +0000</pubDate>
		<dc:creator>Sergio Verdu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blogs.princeton.edu/blogit/?p=287</guid>
		<description><![CDATA[Please add your name to the petition list: http://www.itsoc.org/about/shannons-centenary-us-postal-stamp]]></description>
				<content:encoded><![CDATA[<p>Please add your name to the petition list:</p>
<p>http://www.itsoc.org/about/shannons-centenary-us-postal-stamp</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/05/01/claude-shannon-centennial-stamp-u-s-postal-service/feed/</wfw:commentRss>
		<slash:comments>2</slash:comments>
		</item>
		<item>
		<title>An Inequality Involving a Product of Mutual Informations</title>
		<link>http://blogs.princeton.edu/blogit/2013/04/05/an-inequality-involving-a-product-of-mutual-informations/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/04/05/an-inequality-involving-a-product-of-mutual-informations/#comments</comments>
		<pubDate>Fri, 05 Apr 2013 14:59:26 +0000</pubDate>
		<dc:creator>Jingbo Liu</dc:creator>
				<category><![CDATA[Conjectures]]></category>

		<guid isPermaLink="false">http://blogs.princeton.edu/blogit/?p=236</guid>
		<description><![CDATA[Many inequalities in information theory have entropies or mutual informations appearing as linear additive terms (e.g. Shannon-type inequalities). It’s also not uncommon to see entropies appearing as exponents (e.g. entropy power inequality). But perhaps it’s not immediately seen how products of mutual &#8230; <a href="http://blogs.princeton.edu/blogit/2013/04/05/an-inequality-involving-a-product-of-mutual-informations/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Many inequalities in information theory have entropies or mutual informations appearing as linear additive terms (e.g. Shannon-type inequalities). It’s also not uncommon to see entropies appearing as exponents (e.g. entropy power inequality). But perhaps it’s not immediately seen how products of mutual informations make sense.</p>
<p>Recently I have encountered an inequality involving a product of mutual informations, which I cannot find a good way to prove (or disprove, though some numerics and asymptotic analysis seem to suggest its validity). I would much appreciate it if someone could be smart and gracious enough to provide a proof, or counter example, or generalization.</p>
<p>The formulation is quite simple: suppose <img src="//s0.wp.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="X" title="X" class="latex" /> is a binary random variable with  <img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+P%28X%3D0%29%3DP%28X%3D1%29%3D1%2F2&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="&#92;displaystyle P(X=0)=P(X=1)=1/2" title="&#92;displaystyle P(X=0)=P(X=1)=1/2" class="latex" />. The random variable <img src="//s0.wp.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="Y" title="Y" class="latex" /> is the output of <img src="//s0.wp.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="X" title="X" class="latex" /> passing through a binary symmetric channel. Moreover <img src="//s0.wp.com/latex.php?latex=Z&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="Z" title="Z" class="latex" /> is another random variable such that <img src="//s0.wp.com/latex.php?latex=X-Y-Z&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="X-Y-Z" title="X-Y-Z" class="latex" /> forms a Markov chain. The claim is that <img src="//s0.wp.com/latex.php?latex=I%28X%3BZ%29%5Cge+I%28X%3BY%29I%28Y%3BZ%29&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="I(X;Z)&#92;ge I(X;Y)I(Y;Z)" title="I(X;Z)&#92;ge I(X;Y)I(Y;Z)" class="latex" />. (Note that the left side is also upper bounded by either one of the two factors on the right side, by the well-known data processing  inequality.</p>
<p>At first glance this inequality seems absurd because different units appear on the two sides; but this may be resolved by considering that <img src="//s0.wp.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="Y" title="Y" class="latex" /> has only one bit of information.</p>
<p>For a given joint distribution of <img src="//s0.wp.com/latex.php?latex=X&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="X" title="X" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="Y" title="Y" class="latex" />, the equality can be achieved when <img src="//s0.wp.com/latex.php?latex=Z&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="Z" title="Z" class="latex" /> is an injective function of <img src="//s0.wp.com/latex.php?latex=Y&#038;bg=ffffff&#038;fg=000&#038;s=0" alt="Y" title="Y" class="latex" />.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/04/05/an-inequality-involving-a-product-of-mutual-informations/feed/</wfw:commentRss>
		<slash:comments>3</slash:comments>
		</item>
		<item>
		<title>Teaching Lossless Data Compression</title>
		<link>http://blogs.princeton.edu/blogit/2013/04/01/teaching-lossless-data-compression/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/04/01/teaching-lossless-data-compression/#comments</comments>
		<pubDate>Mon, 01 Apr 2013 17:29:58 +0000</pubDate>
		<dc:creator>Sergio Verdu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blogs.princeton.edu/blogit/?p=232</guid>
		<description><![CDATA[http://www.princeton.edu/~verdu/reprints/VerITNL2011.pdf]]></description>
				<content:encoded><![CDATA[<p>http://www.princeton.edu/~verdu/reprints/VerITNL2011.pdf</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/04/01/teaching-lossless-data-compression/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>An Extremal Conjecture:  Experimenting with Online Collaboration</title>
		<link>http://blogs.princeton.edu/blogit/2013/03/05/an-extremal-conjecture-experimenting-with-online-collaboration/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/03/05/an-extremal-conjecture-experimenting-with-online-collaboration/#comments</comments>
		<pubDate>Tue, 05 Mar 2013 17:57:55 +0000</pubDate>
		<dc:creator>Thomas Courtade</dc:creator>
				<category><![CDATA[Conjectures]]></category>
		<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">http://blogs.princeton.edu/blogit/?p=165</guid>
		<description><![CDATA[I have an extremal conjecture that I have been working on intermittently with some colleagues (including Jiantao Jiao, Tsachy Weissman, Chandra Nair, and Kartik Venkat). Despite our efforts, we have not been able to prove it. Hence, I thought I &#8230; <a href="http://blogs.princeton.edu/blogit/2013/03/05/an-extremal-conjecture-experimenting-with-online-collaboration/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>I have an extremal conjecture that I have been working on intermittently with some colleagues (including Jiantao Jiao, Tsachy Weissman, Chandra Nair, and Kartik Venkat). Despite our efforts, we have not been able to prove it. Hence, I thought I would experiment with online collaboration by offering it to the broader IT community.</p>
<p><span style="color: #000000"><strong>In order to make things interesting, we are offering a $1000 prize for the first correct proof of the conjecture, or a $250 award for the first counterexample!</strong> </span>Feel free to post your thoughts in the public comments. You can also email me if you have questions or want to bounce some ideas off me.</p>
<p>Although I have no way of enforcing them, please abide by the following ground rules:</p>
<ol>
<li>If you decide to work on this conjecture, please send me an email to let me know that you are doing so. As part of this experiment with online collaboration, I want to gauge how many people become involved at various degrees.</li>
<li>If you solve the conjecture or make significant progress, please keep me informed.</li>
<li>If you repost this conjecture, or publish any results, please cite this blog post appropriately.</li>
</ol>
<p>One final disclaimer: this post is meant to be a brief introduction to the conjecture, with a few partial results to get the conversation started; it is not an exhaustive account of the approaches we have tried.</p>
<p style="text-align: center"><b>1. The Conjecture </b></p>
<blockquote><p><b>Conjecture 1. </b> <em><a name="conjextremal"></a> Suppose <img src="//s0.wp.com/latex.php?latex=%7BX%2CY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X,Y}" title="{X,Y}" class="latex" /> are jointly Gaussian, each with unit variance and correlation <img src="//s0.wp.com/latex.php?latex=%7B%5Crho%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;rho}" title="{&#92;rho}" class="latex" />. Then, for any <img src="//s0.wp.com/latex.php?latex=%7BU%2CV%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U,V}" title="{U,V}" class="latex" /> satisfying <img src="//s0.wp.com/latex.php?latex=%7BU-X-Y-V%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U-X-Y-V}" title="{U-X-Y-V}" class="latex" />, the following inequality holds: <a name="main"></a><br />
</em></p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+2%5E%7B-2I%28Y%3BU%29%7D+2%5E%7B-2I%28X%3BV%7CU%29%7D+%5Cgeq+%281-%5Crho%5E2%29%2B+%5Crho%5E2+2%5E%7B-2I%28X%3BU%29%7D+2%5E%7B-2I%28Y%3BV%7CU%29%7D+.+%5C+%5C+%5C+%5C+%5C+%281%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle 2^{-2I(Y;U)} 2^{-2I(X;V|U)} &#92;geq (1-&#92;rho^2)+ &#92;rho^2 2^{-2I(X;U)} 2^{-2I(Y;V|U)} . &#92; &#92; &#92; &#92; &#92; (1)" title="&#92;displaystyle 2^{-2I(Y;U)} 2^{-2I(X;V|U)} &#92;geq (1-&#92;rho^2)+ &#92;rho^2 2^{-2I(X;U)} 2^{-2I(Y;V|U)} . &#92; &#92; &#92; &#92; &#92; (1)" class="latex" /></p>
<p><em><a name="main"></a> </em></p></blockquote>
<p style="text-align: center"><b>2. Partial Results </b></p>
<p>There are several partial results which suggest the validity of Conjecture <a href="#conjextremal" class="liinternal">1</a>. Moreover, numerical experiments have not produced a counterexample.</p>
<p>Conjecture <a href="#conjextremal" class="liinternal">1</a> extends the following well-known consequence of the conditional entropy power inequality to include long Markov chains.</p>
<blockquote><p><b>Lemma 1 (Oohama, 1997). </b> <em><a name="lemepi"></a> Suppose <img src="//s0.wp.com/latex.php?latex=%7BX%2CY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X,Y}" title="{X,Y}" class="latex" /> are jointly Gaussian, each with unit variance and correlation <img src="//s0.wp.com/latex.php?latex=%7B%5Crho%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;rho}" title="{&#92;rho}" class="latex" />. Then, for any <img src="//s0.wp.com/latex.php?latex=%7BU%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U}" title="{U}" class="latex" /> satisfying <img src="//s0.wp.com/latex.php?latex=%7BU-X-Y%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U-X-Y}" title="{U-X-Y}" class="latex" />, the following inequality holds:<br />
</em></p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+2%5E%7B-2+I%28Y%3BU%29%7D+%5Cgeq+1-%5Crho%5E2%2B%5Crho%5E2+2%5E%7B-2I%28X%3BU%29%7D.+%5C+%5C+%5C+%5C+%5C+%282%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle 2^{-2 I(Y;U)} &#92;geq 1-&#92;rho^2+&#92;rho^2 2^{-2I(X;U)}. &#92; &#92; &#92; &#92; &#92; (2)" title="&#92;displaystyle 2^{-2 I(Y;U)} &#92;geq 1-&#92;rho^2+&#92;rho^2 2^{-2I(X;U)}. &#92; &#92; &#92; &#92; &#92; (2)" class="latex" /></p>
</blockquote>
<p><em>Proof:</em> Consider any <img src="//s0.wp.com/latex.php?latex=%7BU%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U}" title="{U}" class="latex" /> satisfying <img src="//s0.wp.com/latex.php?latex=%7BU-X-Y%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U-X-Y}" title="{U-X-Y}" class="latex" />. Let <img src="//s0.wp.com/latex.php?latex=%7BY_u%2C+X_u%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Y_u, X_u}" title="{Y_u, X_u}" class="latex" /> denote the random variables <img src="//s0.wp.com/latex.php?latex=%7BX%2CY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X,Y}" title="{X,Y}" class="latex" /> conditioned on <img src="//s0.wp.com/latex.php?latex=%7BU%3Du%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U=u}" title="{U=u}" class="latex" />. By Markovity and definition of <img src="//s0.wp.com/latex.php?latex=%7BX%2CY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X,Y}" title="{X,Y}" class="latex" />, we have that <img src="//s0.wp.com/latex.php?latex=%7BY_u+%3D+%5Crho+X_u+%2B+Z%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Y_u = &#92;rho X_u + Z}" title="{Y_u = &#92;rho X_u + Z}" class="latex" />, where <img src="//s0.wp.com/latex.php?latex=%7BZ%5Csim+N%280%2C1-%5Crho%5E2%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z&#92;sim N(0,1-&#92;rho^2)}" title="{Z&#92;sim N(0,1-&#92;rho^2)}" class="latex" /> is independent of <img src="//s0.wp.com/latex.php?latex=%7BX_u%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X_u}" title="{X_u}" class="latex" />. Hence, the conditional entropy power inequality implies that</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+2%5E%7B2h%28Y%7CU%29%7D+%5Cgeq+%5Crho%5E2+2%5E%7B2h%28X%7CU%29%7D+%2B+2+%5Cpi+e%281-%5Crho%5E2%29+%3D+2+%5Cpi+e+%5Crho%5E2+2%5E%7B-2I%28X%3BU%29%7D+%2B+2+%5Cpi+e%281-%5Crho%5E2%29.+%5C+%5C+%5C+%5C+%5C+%283%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle 2^{2h(Y|U)} &#92;geq &#92;rho^2 2^{2h(X|U)} + 2 &#92;pi e(1-&#92;rho^2) = 2 &#92;pi e &#92;rho^2 2^{-2I(X;U)} + 2 &#92;pi e(1-&#92;rho^2). &#92; &#92; &#92; &#92; &#92; (3)" title="&#92;displaystyle 2^{2h(Y|U)} &#92;geq &#92;rho^2 2^{2h(X|U)} + 2 &#92;pi e(1-&#92;rho^2) = 2 &#92;pi e &#92;rho^2 2^{-2I(X;U)} + 2 &#92;pi e(1-&#92;rho^2). &#92; &#92; &#92; &#92; &#92; (3)" class="latex" /></p>
<p>From here, the lemma easily follows. <img src="//s0.wp.com/latex.php?latex=%5CBox&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;Box" title="&#92;Box" class="latex" /></p>
<p>Lemma <a href="#lemepi" class="liinternal">1</a> can be applied to prove the following special case of Conjecture <a href="#conjextremal" class="liinternal">1</a>. This result subsumes most of the special cases I can think of analyzing analytically.</p>
<blockquote><p><b>Proposition 1. </b> <em><a name="thmextremal"></a> Suppose <img src="//s0.wp.com/latex.php?latex=%7BX%2CY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X,Y}" title="{X,Y}" class="latex" /> are jointly Gaussian, each with unit variance and correlation <img src="//s0.wp.com/latex.php?latex=%7B%5Crho%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;rho}" title="{&#92;rho}" class="latex" />. If <img src="//s0.wp.com/latex.php?latex=%7BU-X-Y%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U-X-Y}" title="{U-X-Y}" class="latex" /> are jointly Gaussian and <img src="//s0.wp.com/latex.php?latex=%7BU-X-Y-V%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U-X-Y-V}" title="{U-X-Y-V}" class="latex" />, then <a name="extremal"></a><br />
</em></p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+2%5E%7B-2I%28Y%3BU%29%7D+2%5E%7B-2I%28X%3BV%7CU%29%7D+%5Cgeq+%281-%5Crho%5E2%29%2B+%5Crho%5E2+2%5E%7B-2I%28X%3BU%29%7D+2%5E%7B-2I%28Y%3BV%7CU%29%7D.+%5C+%5C+%5C+%5C+%5C+%284%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle 2^{-2I(Y;U)} 2^{-2I(X;V|U)} &#92;geq (1-&#92;rho^2)+ &#92;rho^2 2^{-2I(X;U)} 2^{-2I(Y;V|U)}. &#92; &#92; &#92; &#92; &#92; (4)" title="&#92;displaystyle 2^{-2I(Y;U)} 2^{-2I(X;V|U)} &#92;geq (1-&#92;rho^2)+ &#92;rho^2 2^{-2I(X;U)} 2^{-2I(Y;V|U)}. &#92; &#92; &#92; &#92; &#92; (4)" class="latex" /></p>
</blockquote>
<p><em>Proof:</em> Without loss of generality, we can assume that <img src="//s0.wp.com/latex.php?latex=%7BU%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U}" title="{U}" class="latex" /> has zero mean and unit variance. Define <img src="//s0.wp.com/latex.php?latex=%7B%5Crho_u+%3D+E%5BXU%5D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;rho_u = E[XU]}" title="{&#92;rho_u = E[XU]}" class="latex" />. Since <img src="//s0.wp.com/latex.php?latex=%7BU-X-Y%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U-X-Y}" title="{U-X-Y}" class="latex" /> are jointly Gaussian, we have<br />
<a name="Ixu"></a></p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+I%28X%3BU%29+%3D%5Cfrac%7B1%7D%7B2%7D%5Clog%5Cfrac%7B1%7D%7B1-%5Crho_u%5E2%7D+%5C+%5C+%5C+%5C+%5C+%285%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle I(X;U) =&#92;frac{1}{2}&#92;log&#92;frac{1}{1-&#92;rho_u^2} &#92; &#92; &#92; &#92; &#92; (5)" title="&#92;displaystyle I(X;U) =&#92;frac{1}{2}&#92;log&#92;frac{1}{1-&#92;rho_u^2} &#92; &#92; &#92; &#92; &#92; (5)" class="latex" /></p>
<p><a name="Iyu"></a></p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+I%28Y%3BU%29+%3D%5Cfrac%7B1%7D%7B2%7D%5Clog%5Cfrac%7B1%7D%7B1-%5Crho%5E2%5Crho_u%5E2%7D.+%5C+%5C+%5C+%5C+%5C+%286%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle I(Y;U) =&#92;frac{1}{2}&#92;log&#92;frac{1}{1-&#92;rho^2&#92;rho_u^2}. &#92; &#92; &#92; &#92; &#92; (6)" title="&#92;displaystyle I(Y;U) =&#92;frac{1}{2}&#92;log&#92;frac{1}{1-&#92;rho^2&#92;rho_u^2}. &#92; &#92; &#92; &#92; &#92; (6)" class="latex" /></p>
<p>Let <img src="//s0.wp.com/latex.php?latex=%7BX_u%2CY_u%2CV_u%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X_u,Y_u,V_u}" title="{X_u,Y_u,V_u}" class="latex" /> denote the random variables <img src="//s0.wp.com/latex.php?latex=%7BX%2CY%2CV%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X,Y,V}" title="{X,Y,V}" class="latex" /> conditioned on <img src="//s0.wp.com/latex.php?latex=%7BU%3Du%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U=u}" title="{U=u}" class="latex" />, respectively. Define <img src="//s0.wp.com/latex.php?latex=%7B%5Crho_%7BXY%7Cu%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;rho_{XY|u}}" title="{&#92;rho_{XY|u}}" class="latex" /> to be the correlation coefficient between <img src="//s0.wp.com/latex.php?latex=%7BX_u%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X_u}" title="{X_u}" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=%7BY_u%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Y_u}" title="{Y_u}" class="latex" />. It is readily verified that <a name="pxyu"></a></p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Crho_%7BXY%7Cu%7D+%3D+%5Cfrac%7B%5Crho%5Csqrt%7B1-%5Crho_u%5E2%7D%7D%7B%5Csqrt%7B1-%5Crho%5E2%5Crho_u%5E2%7D%7D%2C+%5C+%5C+%5C+%5C+%5C+%287%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle &#92;rho_{XY|u} = &#92;frac{&#92;rho&#92;sqrt{1-&#92;rho_u^2}}{&#92;sqrt{1-&#92;rho^2&#92;rho_u^2}}, &#92; &#92; &#92; &#92; &#92; (7)" title="&#92;displaystyle &#92;rho_{XY|u} = &#92;frac{&#92;rho&#92;sqrt{1-&#92;rho_u^2}}{&#92;sqrt{1-&#92;rho^2&#92;rho_u^2}}, &#92; &#92; &#92; &#92; &#92; (7)" class="latex" /></p>
<p>which does not depend on the particular value of <img src="//s0.wp.com/latex.php?latex=%7Bu%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{u}" title="{u}" class="latex" />. By plugging <a href="#Ixu" class="liinternal">(5)</a>-<a href="#pxyu" class="liinternal">(7)</a> into <a href="#extremal" class="liinternal">(4)</a>, we see that <a href="#extremal" class="liinternal">(4)</a> is equivalent to <a name="desired"></a></p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+2%5E%7B-2I%28X%3BV%7CU%29%7D+%5Cgeq+%281-%5Crho_%7BXY%7Cu%7D%5E2%29%2B+%5Crho_%7BXY%7Cu%7D%5E2+2%5E%7B-2I%28Y%3BV%7CU%29%7D.+%5C+%5C+%5C+%5C+%5C+%288%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle 2^{-2I(X;V|U)} &#92;geq (1-&#92;rho_{XY|u}^2)+ &#92;rho_{XY|u}^2 2^{-2I(Y;V|U)}. &#92; &#92; &#92; &#92; &#92; (8)" title="&#92;displaystyle 2^{-2I(X;V|U)} &#92;geq (1-&#92;rho_{XY|u}^2)+ &#92;rho_{XY|u}^2 2^{-2I(Y;V|U)}. &#92; &#92; &#92; &#92; &#92; (8)" class="latex" /></p>
<p>For every <img src="//s0.wp.com/latex.php?latex=%7Bu%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{u}" title="{u}" class="latex" />, the variables <img src="//s0.wp.com/latex.php?latex=%7BX_u%2CY_u%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X_u,Y_u}" title="{X_u,Y_u}" class="latex" /> are jointly Gaussian with correlation coefficient <img src="//s0.wp.com/latex.php?latex=%7B%5Crho_%7BXY%7Cu%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;rho_{XY|u}}" title="{&#92;rho_{XY|u}}" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=%7BX_u-Y_u-V_u%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X_u-Y_u-V_u}" title="{X_u-Y_u-V_u}" class="latex" /> form a Markov chain, hence Lemma <a href="#lemepi" class="liinternal">1</a> implies</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+2%5E%7B-2I%28X_u%3BV_u%29%7D+%5Cgeq+%281-%5Crho_%7BXY%7Cu%7D%5E2%29%2B+%5Crho_%7BXY%7Cu%7D%5E2+2%5E%7B-2I%28Y_u%3BV_u%29%7D.+%5C+%5C+%5C+%5C+%5C+%289%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle 2^{-2I(X_u;V_u)} &#92;geq (1-&#92;rho_{XY|u}^2)+ &#92;rho_{XY|u}^2 2^{-2I(Y_u;V_u)}. &#92; &#92; &#92; &#92; &#92; (9)" title="&#92;displaystyle 2^{-2I(X_u;V_u)} &#92;geq (1-&#92;rho_{XY|u}^2)+ &#92;rho_{XY|u}^2 2^{-2I(Y_u;V_u)}. &#92; &#92; &#92; &#92; &#92; (9)" class="latex" /></p>
<p>The desired inequality <a href="#desired" class="liinternal">(8)</a> follows by convexity of</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Clog%5Cleft%5B%281-%5Crho_%7BXY%7Cu%7D%5E2%29%2B+%5Crho_%7BXY%7Cu%7D%5E2+2%5E%7B-2z%7D%5Cright%5D+%5C+%5C+%5C+%5C+%5C+%2810%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle &#92;log&#92;left[(1-&#92;rho_{XY|u}^2)+ &#92;rho_{XY|u}^2 2^{-2z}&#92;right] &#92; &#92; &#92; &#92; &#92; (10)" title="&#92;displaystyle &#92;log&#92;left[(1-&#92;rho_{XY|u}^2)+ &#92;rho_{XY|u}^2 2^{-2z}&#92;right] &#92; &#92; &#92; &#92; &#92; (10)" class="latex" /></p>
<p>as a function of <img src="//s0.wp.com/latex.php?latex=%7Bz%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{z}" title="{z}" class="latex" />. <img src="//s0.wp.com/latex.php?latex=%5CBox&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;Box" title="&#92;Box" class="latex" /></p>
<p style="text-align: center"><b>3. Equivalent Forms </b></p>
<p>There are many equivalent forms of Conjecture <a href="#conjextremal" class="liinternal">1</a>. For example, <a href="#main" class="liinternal">(1)</a> can be replaced by the symmetric inequality</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+2%5E%7B-2%28I%28X%3BV%29%2BI%28Y%3BU%29%29%7D+%5Cgeq+%281-%5Crho%5E2%292%5E%7B-2I%28U%3BV%29%7D%2B+%5Crho%5E2+2%5E%7B-2%28I%28X%3BU%29%2BI%28Y%3BV%29%29%7D.+%5C+%5C+%5C+%5C+%5C+%2811%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle 2^{-2(I(X;V)+I(Y;U))} &#92;geq (1-&#92;rho^2)2^{-2I(U;V)}+ &#92;rho^2 2^{-2(I(X;U)+I(Y;V))}. &#92; &#92; &#92; &#92; &#92; (11)" title="&#92;displaystyle 2^{-2(I(X;V)+I(Y;U))} &#92;geq (1-&#92;rho^2)2^{-2I(U;V)}+ &#92;rho^2 2^{-2(I(X;U)+I(Y;V))}. &#92; &#92; &#92; &#92; &#92; (11)" class="latex" /></p>
<p>Alternatively, we can consider dual forms of Conjecture <a href="#conjextremal" class="liinternal">1</a>. For instance, one such form is stated as follows:</p>
<blockquote><p><b>Conjecture 1′. </b> <em> Suppose <img src="//s0.wp.com/latex.php?latex=%7BX%2CY%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X,Y}" title="{X,Y}" class="latex" /> are jointly Gaussian, each with unit variance and correlation <img src="//s0.wp.com/latex.php?latex=%7B%5Crho%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;rho}" title="{&#92;rho}" class="latex" />. For <img src="//s0.wp.com/latex.php?latex=%7B%5Clambda%5Cin+%5B%7B1%7D%2F%28%7B1%2B%5Crho%5E2%7D%29%2C1%5D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;lambda&#92;in [{1}/({1+&#92;rho^2}),1]}" title="{&#92;lambda&#92;in [{1}/({1+&#92;rho^2}),1]}" class="latex" />, the infimum of<br />
</em></p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+I%28X%2CY%3BU%2CV%29-%5Clambda%5CBig%28I%28X%3BUV%29%2BI%28Y%3BUV%29%5CBig%29%2C+%5C+%5C+%5C+%5C+%5C+%2812%29&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle I(X,Y;U,V)-&#92;lambda&#92;Big(I(X;UV)+I(Y;UV)&#92;Big), &#92; &#92; &#92; &#92; &#92; (12)" title="&#92;displaystyle I(X,Y;U,V)-&#92;lambda&#92;Big(I(X;UV)+I(Y;UV)&#92;Big), &#92; &#92; &#92; &#92; &#92; (12)" class="latex" /></p>
<p>taken over all <img src="//s0.wp.com/latex.php?latex=%7BU%2CV%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U,V}" title="{U,V}" class="latex" /> satisfying <img src="//s0.wp.com/latex.php?latex=%7BU-X-Y-V%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U-X-Y-V}" title="{U-X-Y-V}" class="latex" /> is attained when <img src="//s0.wp.com/latex.php?latex=%7BU%2CX%2CY%2CV%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{U,X,Y,V}" title="{U,X,Y,V}" class="latex" /> are jointly Gaussian.</p></blockquote>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/03/05/an-extremal-conjecture-experimenting-with-online-collaboration/feed/</wfw:commentRss>
		<slash:comments>4</slash:comments>
		</item>
		<item>
		<title>President Obama bestows 2011 National Medals of Science and Technology</title>
		<link>http://blogs.princeton.edu/blogit/2013/02/01/president-obama-bestows-national-medals-of-science-and-technology/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/02/01/president-obama-bestows-national-medals-of-science-and-technology/#comments</comments>
		<pubDate>Sat, 02 Feb 2013 00:51:38 +0000</pubDate>
		<dc:creator>Sergio Verdu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">https://blogs.princeton.edu/blogit/?p=159</guid>
		<description><![CDATA[including Claude E. Shannon Award winner Sol Golomb http://www.whitehouse.gov/photos-and-video/video/2013/02/01/president-obama-honors-countrys-top-innovators-and-scientists-2011]]></description>
				<content:encoded><![CDATA[<p>including Claude E. Shannon Award winner Sol Golomb</p>
<p>http://www.whitehouse.gov/photos-and-video/video/2013/02/01/president-obama-honors-countrys-top-innovators-and-scientists-2011</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/02/01/president-obama-bestows-national-medals-of-science-and-technology/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Nevanlinna Prize</title>
		<link>http://blogs.princeton.edu/blogit/2013/01/31/nevanlinna-prize/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/01/31/nevanlinna-prize/#comments</comments>
		<pubDate>Thu, 31 Jan 2013 20:24:21 +0000</pubDate>
		<dc:creator>Sergio Verdu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">https://blogs.princeton.edu/blogit/?p=156</guid>
		<description><![CDATA[Nominations of people born on or after January 1, 1974 for outstanding contributions in Mathematical Aspects of Information Sciences including: All mathematical aspects of computer science, including complexity theory, logic of programming languages, analysis of algorithms, cryptography, computer vision, pattern &#8230; <a href="http://blogs.princeton.edu/blogit/2013/01/31/nevanlinna-prize/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Nominations of people born on or after January 1, 1974</p>
<p>for outstanding contributions in Mathematical Aspects of Information Sciences including:</p>
<ol>
<li>All mathematical aspects of computer science, including complexity theory, logic of programming languages, analysis of algorithms, cryptography, computer vision, pattern recognition, information processing and modelling of intelligence.</li>
<li>Scientific computing and numerical analysis. Computational aspects of optimization and control theory. Computer algebra.</li>
</ol>
<p>Nomination Procedure: http://www.mathunion.org/general/prizes/nevanlinna/details/</p>
<p> </p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/01/31/nevanlinna-prize/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The Epijournal: a new publication model</title>
		<link>http://blogs.princeton.edu/blogit/2013/01/23/the-epijournal-a-new-publication-model/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/01/23/the-epijournal-a-new-publication-model/#comments</comments>
		<pubDate>Wed, 23 Jan 2013 18:27:59 +0000</pubDate>
		<dc:creator>Sergio Verdu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">https://blogs.princeton.edu/blogit/?p=154</guid>
		<description><![CDATA[http://www.nature.com/news/mathematicians-aim-to-take-publishers-out-of-publishing-1.12243]]></description>
				<content:encoded><![CDATA[<p>http://www.nature.com/news/mathematicians-aim-to-take-publishers-out-of-publishing-1.12243</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/01/23/the-epijournal-a-new-publication-model/feed/</wfw:commentRss>
		<slash:comments>1</slash:comments>
		</item>
		<item>
		<title>Information and Inference (new journal)</title>
		<link>http://blogs.princeton.edu/blogit/2013/01/03/information-and-inference-new-journal/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/01/03/information-and-inference-new-journal/#comments</comments>
		<pubDate>Thu, 03 Jan 2013 18:01:02 +0000</pubDate>
		<dc:creator>Sergio Verdu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">https://blogs.princeton.edu/blogit/?p=149</guid>
		<description><![CDATA[The first issue of Information and Inference has just appeared: http://imaiai.oxfordjournals.org/content/current It includes the following editorial: In recent years, a great deal of energy and talent have been devoted to new research problems arising from our era of abundant and &#8230; <a href="http://blogs.princeton.edu/blogit/2013/01/03/information-and-inference-new-journal/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>The first issue of Information and Inference has just appeared:</p>
<p><a href="http://imaiai.oxfordjournals.org/content/current" class="liexternal">http://imaiai.oxfordjournals.org/content/current</a></p>
<p>It includes the following editorial:</p>
<div>
<p>In recent years, a great deal of energy and talent have been devoted to new research problems arising from our era of abundant and varied data/information. These efforts have combined advanced methods drawn from across the spectrum of established academic disciplines: discrete and applied mathematics, computer science, theoretical statistics, physics, engineering, biology and even finance. This new journal is designed to serve as a meeting place for ideas connecting the theory and application of information and inference from across these disciplines.</p>
<p>While the frontiers of research involving information and inference are dynamic, we are currently planning to publish in information theory, statistical inference, network analysis, numerical analysis, learning theory, applied and computational harmonic analysis, probability, combinatorics, signal and image processing, and high-dimensional geometry; we also encourage papers not fitting the above description, but which expose novel problems, innovative data types, surprising connections between disciplines and alternative approaches to inference. This first issue exemplifies this topical diversity of the subject matter, linked by the use of sophisticated mathematical modelling, techniques of analysis, and focus on timely applications.</p>
<p>To enhance the impact of each manuscript, authors are encouraged to provide software to illus– trate their algorithm and where possible replicate the experiments presented in their manuscripts. Manuscripts with accompanying software are marked as “reproducible” and have the software linked on the journal website under supplementary material. It is with pleasure that we welcome the scien– tific community to this new publication venue.</p>
<p>Robert Calderbank David L. Donoho John Shawe-Taylor Jared Tanner</p>
</div>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/01/03/information-and-inference-new-journal/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>Comparing Variability of Random Variables</title>
		<link>http://blogs.princeton.edu/blogit/2013/01/02/comparing-variability-of-random-variables/</link>
		<comments>http://blogs.princeton.edu/blogit/2013/01/02/comparing-variability-of-random-variables/#comments</comments>
		<pubDate>Thu, 03 Jan 2013 01:47:18 +0000</pubDate>
		<dc:creator>Tara Javidi</dc:creator>
				<category><![CDATA[Uncategorized]]></category>
		<category><![CDATA[Useful Facts]]></category>

		<guid isPermaLink="false">https://blogs.princeton.edu/blogit/?p=133</guid>
		<description><![CDATA[Consider exchangeable random variables . A couple of facts seem quite intuitive: Statement 1. The “variability” of sample mean decreases with . Statement 2. Let the average of functions be defined as . Then is less “variable” than .   &#8230; <a href="http://blogs.princeton.edu/blogit/2013/01/02/comparing-variability-of-random-variables/">Continue reading <span class="meta-nav">&#8594;</span></a>]]></description>
				<content:encoded><![CDATA[<p>Consider exchangeable random variables <img src="//s0.wp.com/latex.php?latex=%7BX_1%2C+%5Cldots%2C+X_n%2C+%5Cldots%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{X_1, &#92;ldots, X_n, &#92;ldots}" title="{X_1, &#92;ldots, X_n, &#92;ldots}" class="latex" />. A couple of facts seem quite intuitive:</p>
<p><strong>Statement 1.</strong> The “variability” of sample mean <img src="//s0.wp.com/latex.php?latex=%7BS_m+%3D+%5Cfrac%7B1%7D%7Bm%7D+%5Csum_%7Bi%3D1%7D%5E%7Bm%7D+X_i%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{S_m = &#92;frac{1}{m} &#92;sum_{i=1}^{m} X_i}" title="{S_m = &#92;frac{1}{m} &#92;sum_{i=1}^{m} X_i}" class="latex" /> decreases with <img src="//s0.wp.com/latex.php?latex=%7Bm%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{m}" title="{m}" class="latex" />.</p>
<p><strong>Statement 2.</strong> Let the average of functions <img src="//s0.wp.com/latex.php?latex=%7Bf_1%2C+f_2%2C+%5Cldots%2C+f_n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{f_1, f_2, &#92;ldots, f_n}" title="{f_1, f_2, &#92;ldots, f_n}" class="latex" /> be defined as <img src="//s0.wp.com/latex.php?latex=%7B%5Coverline%7Bf%7D+%28x%29+%3A%3D+%5Cfrac%7B1%7D%7Bn%7D+%5Csum_%7Bi%3D1%7D%5E%7Bn%7D+f_i%28x%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;overline{f} (x) := &#92;frac{1}{n} &#92;sum_{i=1}^{n} f_i(x)}" title="{&#92;overline{f} (x) := &#92;frac{1}{n} &#92;sum_{i=1}^{n} f_i(x)}" class="latex" />. Then <img src="//s0.wp.com/latex.php?latex=%7B%5Cmax_%7B1%5Cleq+i+%5Cleq+n%7D+%5Coverline%7Bf%7D%28X_i%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;max_{1&#92;leq i &#92;leq n} &#92;overline{f}(X_i)}" title="{&#92;max_{1&#92;leq i &#92;leq n} &#92;overline{f}(X_i)}" class="latex" /> is less “variable” than <img src="//s0.wp.com/latex.php?latex=%7B%5Cmax_%7B1%5Cleq+i+%5Cleq+n%7D+f_i+%28X_i%29%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;max_{1&#92;leq i &#92;leq n} f_i (X_i)}" title="{&#92;max_{1&#92;leq i &#92;leq n} f_i (X_i)}" class="latex" />.</p>
<p> </p>
<p>To make these statements precise, one faces the fundamental question of comparing two random variables <img src="//s0.wp.com/latex.php?latex=%7BW%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W}" title="{W}" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z}" title="{Z}" class="latex" /> (or more precisely comparing two distributions). One common way we think of ordering random variables is the notion of stochastic dominance:</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+W+%5Cleq_%7Bst%7D+Z+%5CLeftrightarrow+F_W%28t%29+%5Cgeq+F_Z%28t%29+%5C+%5C+%5C+%5Cmbox%7B+for+all+real+%7D+t.+&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle W &#92;leq_{st} Z &#92;Leftrightarrow F_W(t) &#92;geq F_Z(t) &#92; &#92; &#92; &#92;mbox{ for all real } t. " title="&#92;displaystyle W &#92;leq_{st} Z &#92;Leftrightarrow F_W(t) &#92;geq F_Z(t) &#92; &#92; &#92; &#92;mbox{ for all real } t. " class="latex" /></p>
<p>However, this notion really is only a suitable notion when one is concerned with the actual size of the random quantities of interest, while, in our scenario of interest, a more natural order would be that which compares the variability between two random variables (or more precisely, again, the two distributions). It turns out that a very useful notion, used in a variety of fields, is due to Ross (1983): Random variable <img src="//s0.wp.com/latex.php?latex=%7BW%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W}" title="{W}" class="latex" /> is said to be <em>stochastically less variable</em> than random variable <img src="//s0.wp.com/latex.php?latex=%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z}" title="{Z}" class="latex" /> (denoted by <img src="//s0.wp.com/latex.php?latex=%7B%5Cleq_v%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;leq_v}" title="{&#92;leq_v}" class="latex" />) when every risk-averse decision maker will choose <img src="//s0.wp.com/latex.php?latex=%7BW%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W}" title="{W}" class="latex" /> over <img src="//s0.wp.com/latex.php?latex=%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z}" title="{Z}" class="latex" /> (given they have similar means). More precisely, for random variables <img src="//s0.wp.com/latex.php?latex=%7BW%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W}" title="{W}" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z}" title="{Z}" class="latex" /> with finite means</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+W+%5Cleq_%7Bv%7D+Z+%5CLeftrightarrow+%5Cmathbb%7BE%7D%5Bf%28X%29%5D+%5Cleq+%5Cmathbb%7BE%7D%5Bf%28Y%29%5D+%5C+%5C+%5Cmbox%7B+for+increasing+and+convex+function+%7D+f+%5Cin+%5Cmathcal%7BF%7D+&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle W &#92;leq_{v} Z &#92;Leftrightarrow &#92;mathbb{E}[f(X)] &#92;leq &#92;mathbb{E}[f(Y)] &#92; &#92; &#92;mbox{ for increasing and convex function } f &#92;in &#92;mathcal{F} " title="&#92;displaystyle W &#92;leq_{v} Z &#92;Leftrightarrow &#92;mathbb{E}[f(X)] &#92;leq &#92;mathbb{E}[f(Y)] &#92; &#92; &#92;mbox{ for increasing and convex function } f &#92;in &#92;mathcal{F} " class="latex" /></p>
<p>where <img src="//s0.wp.com/latex.php?latex=%7B%5Cmathcal%7BF%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;mathcal{F}}" title="{&#92;mathcal{F}}" class="latex" /> is the set of functions for which the above expectations exist.</p>
<p>One interesting, but perhaps not entirely obvious, fact is that this notion of ordering <img src="//s0.wp.com/latex.php?latex=%7BW%5Cleq_v+Z%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W&#92;leq_v Z}" title="{W&#92;leq_v Z}" class="latex" /> is equivalent to saying that there is a sequence of mean preserving spreads that in the limit transforms the distribution of <img src="//s0.wp.com/latex.php?latex=%7BW%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W}" title="{W}" class="latex" /> into the distribution of another random variable <img src="//s0.wp.com/latex.php?latex=%7BW%27%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W&#039;}" title="{W&#039;}" class="latex" /> with finite mean such that <img src="//s0.wp.com/latex.php?latex=%7BW%27%5Cleq_%7Bst%7D+Z%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W&#039;&#92;leq_{st} Z}" title="{W&#039;&#92;leq_{st} Z}" class="latex" />! Also, using results by Hardy, Littlewood and Polya (1929), the stochastic variability order introduced above can be shown to be equivalent to Lorenz (1905) ordering used in economics to measure income equality.</p>
<p>Now with this, we are ready to formalize our previous statements. The first statement is actually due to Arnold and Villasenor (1986):</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cfrac%7B1%7D%7Bm%7D+%5Csum_%7Bi%3D1%7D%5E%7Bm%7D+X_i+%5Cleq_v+%5Cfrac%7B1%7D%7Bm-1%7D+%5Csum_%7Bi%3D1%7D%5E%7Bm-1%7D+X_i+%5C+%5C+%5C+%5C+%5C+%5C+%5C+%5C+%5C+%5C+%5C+%5C+%5Cmbox%7Bfor+all+%7D%5C+%5C+m+%5Cin+%5Cmathbb%7BN%7D.+&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle &#92;frac{1}{m} &#92;sum_{i=1}^{m} X_i &#92;leq_v &#92;frac{1}{m-1} &#92;sum_{i=1}^{m-1} X_i &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92;mbox{for all }&#92; &#92; m &#92;in &#92;mathbb{N}. " title="&#92;displaystyle &#92;frac{1}{m} &#92;sum_{i=1}^{m} X_i &#92;leq_v &#92;frac{1}{m-1} &#92;sum_{i=1}^{m-1} X_i &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92; &#92;mbox{for all }&#92; &#92; m &#92;in &#92;mathbb{N}. " class="latex" /></p>
<p>Note that when you apply this fact to a sequence of iid random variables with finite mean <img src="//s0.wp.com/latex.php?latex=%7B%5Cmu%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;mu}" title="{&#92;mu}" class="latex" />, it strengthens the strong law of large number in that it ensures that the almost sure convergence of the sample mean to the mean value <img src="//s0.wp.com/latex.php?latex=%7B%5Cmu%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;mu}" title="{&#92;mu}" class="latex" /> occurs with monotonically decreasing variability (as the sample size grows).</p>
<p>The second statement comes up in proving certain optimality result in sharing parallel servers in fork-join queueing systems (J. 2008) and has a similar flavor:</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cmax_%7B1%5Cleq+i+%5Cleq+n%7D+%5Coverline%7Bf%7D%28X_i%29+%5Cleq_v+%5Cmax_%7B1%5Cleq+i+%5Cleq+n%7D+f_i+%28X_i%29.+&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle &#92;max_{1&#92;leq i &#92;leq n} &#92;overline{f}(X_i) &#92;leq_v &#92;max_{1&#92;leq i &#92;leq n} f_i (X_i). " title="&#92;displaystyle &#92;max_{1&#92;leq i &#92;leq n} &#92;overline{f}(X_i) &#92;leq_v &#92;max_{1&#92;leq i &#92;leq n} f_i (X_i). " class="latex" /></p>
<p>The cleanest way to prove both statements, to the best of my knowledge, is based on the following theorem first proved by Blackwell in 1953 (later strengthened to random elements in separable Banach spaces by Strassen in 1965, hence referred to by some as Strassen’s theorem):</p>
<blockquote><p><strong>Theorem 1</strong> <em> Let <img src="//s0.wp.com/latex.php?latex=%7BW%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W}" title="{W}" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z}" title="{Z}" class="latex" /> be two random variables with finite means. A necessary and sufficient condition for <img src="//s0.wp.com/latex.php?latex=%7BW+%5Cleq_v+Z%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W &#92;leq_v Z}" title="{W &#92;leq_v Z}" class="latex" /> is that there are two random variables <img src="//s0.wp.com/latex.php?latex=%7B%5Chat%7BW%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;hat{W}}" title="{&#92;hat{W}}" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=%7B%5Chat%7BZ%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;hat{Z}}" title="{&#92;hat{Z}}" class="latex" /> with the same marginals as <img src="//s0.wp.com/latex.php?latex=%7BW%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{W}" title="{W}" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z}" title="{Z}" class="latex" />, respectively, such that <img src="//s0.wp.com/latex.php?latex=%7B%5Cmathbb%7BE%7D%5B%5Chat%7BZ%7D+%7C%5Chat%7BW%7D%5D+%5Cgeq+%5Chat%7BW%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;mathbb{E}[&#92;hat{Z} |&#92;hat{W}] &#92;geq &#92;hat{W}}" title="{&#92;mathbb{E}[&#92;hat{Z} |&#92;hat{W}] &#92;geq &#92;hat{W}}" class="latex" /> almost surely. </em></p></blockquote>
<p>For instance, to prove the first statement we consider <img src="//s0.wp.com/latex.php?latex=%7B%5Chat%7BW%7D+%3D+W+%3D+%5Cfrac%7B1%7D%7Bn%7D+%5Csum_%7Bi%3D1%7D%5En+X_i%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;hat{W} = W = &#92;frac{1}{n} &#92;sum_{i=1}^n X_i}" title="{&#92;hat{W} = W = &#92;frac{1}{n} &#92;sum_{i=1}^n X_i}" class="latex" /> and <img src="//s0.wp.com/latex.php?latex=%7BZ+%3D+%5Cfrac%7B1%7D%7Bn-1%7D+%5Csum_%7Bi%3D1%7D%5E%7Bn-1%7D+X_i%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z = &#92;frac{1}{n-1} &#92;sum_{i=1}^{n-1} X_i}" title="{Z = &#92;frac{1}{n-1} &#92;sum_{i=1}^{n-1} X_i}" class="latex" />. All that is necessary now is to note that <img src="//s0.wp.com/latex.php?latex=%7B%5Chat%7BZ%7D+%3A+%3D+%5Cfrac%7B1%7D%7Bn-1%7D+%5Csum_%7Bi%5Cin+I%2C+i+%5Cneq+J%7D+X_i%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;hat{Z} : = &#92;frac{1}{n-1} &#92;sum_{i&#92;in I, i &#92;neq J} X_i}" title="{&#92;hat{Z} : = &#92;frac{1}{n-1} &#92;sum_{i&#92;in I, i &#92;neq J} X_i}" class="latex" />, <img src="//s0.wp.com/latex.php?latex=%7BJ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{J}" title="{J}" class="latex" /> is an independent uniform rv on the set <img src="//s0.wp.com/latex.php?latex=%7BI+%3A%3D+%5C%7B1%2C2%2C+%5Cldots%2C+n%5C%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{I := &#92;{1,2, &#92;ldots, n&#92;}}" title="{I := &#92;{1,2, &#92;ldots, n&#92;}}" class="latex" />, has the same distribution as random variable <img src="//s0.wp.com/latex.php?latex=%7BZ%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{Z}" title="{Z}" class="latex" />. Furthermore,</p>
<p align="center"><img src="//s0.wp.com/latex.php?latex=%5Cdisplaystyle+%5Cmathbb%7BE%7D+%5B+%5Chat%7BZ%7D+%7C+W+%5D+%3D+%5Cmathbb%7BE%7D+%5B+%5Cfrac%7B1%7D%7Bn%7D+%5Csum_%7BJ%3D1%7D%5E%7Bn%7D+%28%5Cfrac%7B1%7D%7Bn-1%7D+%5Csum_%7Bi%5Cin+I%2C+i+%5Cneq+J%7D+X_i+%29+%7C+W+%5D+%3D+%5Cmathbb%7BE%7D+%5B+%5Cfrac%7B1%7D%7Bn%7D+%5Csum_%7Bj%3D1%7D%5E%7Bn%7D+X_j+%7C+W+%5D+%3D+W.+&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="&#92;displaystyle &#92;mathbb{E} [ &#92;hat{Z} | W ] = &#92;mathbb{E} [ &#92;frac{1}{n} &#92;sum_{J=1}^{n} (&#92;frac{1}{n-1} &#92;sum_{i&#92;in I, i &#92;neq J} X_i ) | W ] = &#92;mathbb{E} [ &#92;frac{1}{n} &#92;sum_{j=1}^{n} X_j | W ] = W. " title="&#92;displaystyle &#92;mathbb{E} [ &#92;hat{Z} | W ] = &#92;mathbb{E} [ &#92;frac{1}{n} &#92;sum_{J=1}^{n} (&#92;frac{1}{n-1} &#92;sum_{i&#92;in I, i &#92;neq J} X_i ) | W ] = &#92;mathbb{E} [ &#92;frac{1}{n} &#92;sum_{j=1}^{n} X_j | W ] = W. " class="latex" /></p>
<p>Similarly to prove the second statement, one can construct <img src="//s0.wp.com/latex.php?latex=%7B%5Chat%7BZ%7D%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{&#92;hat{Z}}" title="{&#92;hat{Z}}" class="latex" /> by selecting a random permutation of functions <img src="//s0.wp.com/latex.php?latex=%7Bf_1%2C+%5Cldots%2C+f_n%7D&#038;bg=ffffff&#038;fg=000000&#038;s=0" alt="{f_1, &#92;ldots, f_n}" title="{f_1, &#92;ldots, f_n}" class="latex" />.</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2013/01/02/comparing-variability-of-random-variables/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
		<item>
		<title>The face of randomness or Poisson for civilians</title>
		<link>http://blogs.princeton.edu/blogit/2012/12/23/the-face-of-randomness-or-poisson-for-civilians/</link>
		<comments>http://blogs.princeton.edu/blogit/2012/12/23/the-face-of-randomness-or-poisson-for-civilians/#comments</comments>
		<pubDate>Sun, 23 Dec 2012 18:49:09 +0000</pubDate>
		<dc:creator>Sergio Verdu</dc:creator>
				<category><![CDATA[Uncategorized]]></category>

		<guid isPermaLink="false">https://blogs.princeton.edu/blogit/?p=128</guid>
		<description><![CDATA[http://www.empiricalzeal.com/2012/12/21/what-does-randomness-look-like/]]></description>
				<content:encoded><![CDATA[<p>http://www.empiricalzeal.com/2012/12/21/what-does-randomness-look-like/</p>
]]></content:encoded>
			<wfw:commentRss>http://blogs.princeton.edu/blogit/2012/12/23/the-face-of-randomness-or-poisson-for-civilians/feed/</wfw:commentRss>
		<slash:comments>0</slash:comments>
		</item>
	</channel>
</rss>
