**T**he controversy about the data in Piketty’s *Capital in the 21st Century* fed a dozen articles in the NYT last week, not to mention other newspapers… To the point of Bloomberg producing this parody of a glitzy popular press cover. I have not read the book, nor intend to read it, but I find the buzz and controversies rather surprising. (Thomas Piketty will actually give a public BBC interview in Paris-Dauphine and in English two weeks from now, on June 19.) Indeed, the book sounds to me like a data analysis of extensive (and elaborately processed) datasets rather than the construct of a new economic theory. And I could not see how the criticisms from the Financial Time were anywhere beyond the obvious ideological. I was thus pleased to read Francis Diebold’s blog on this craze. And to see the repeated acknowledgements that Piketty’s data being made public was a great thing. Although somewhat dismayed that such a painstakenly gathered dataset was not submitted to a more advanced econometric/statistical processing than the one offered by Excel… Definitely looking forward a more [statistically] involved analysis of the data. (Unsurprisingly, Nassim Taleb reanalysed the whole craze as missing the fat tail features of wealth accumulation. With new aphorisms like “the fastest road to bankruptcy in foreign exchange was an economics degree”.)

## Archive for Thomas Piketty

## on the Piketty craze [not a book review]

Posted in Books, University life with tags BBC, Capital, Francis Diebold, Nassim Taleb, Thomas Piketty, Université Paris Dauphine, wealth on June 15, 2014 by xi'an## bias in estimating bracketed quantile contributions

Posted in Books, R, Statistics, University life with tags BXing, heavy-tail distribution, Nassim Taleb, Pareto distribution, The Bed of Procrustes, Thomas Piketty, Vilfredo Pareto on May 16, 2014 by xi'an

“Vilfredo Pareto noticed that 80% of the land in Italy belonged to 20% of the population, and vice-versa, thus both giving birth to the power law class of distributions and the popular saying 80/20.”

**Y**esterday, in “one of those” coincidences, I voluntarily dropped Nassim Taleb’s *The Bed of Procrustes* in a suburban café as my latest contribution to the book-crossing (or bXing!) concept and spotted a newly arXived paper by Taleb and Douadi. Paper which full title is *“On the Biases and Variability in the Estimation of Concentration Using Bracketed Quantile Contributions”* and which central idea is that estimating

(where q_{α} is the α-level quantile of X) by the ratio

can be strongly biased. And that the fatter the tail (i.e. the lower the power β for a power law tail), the worse the bias. This is definitely correct, if not entirely surprising given that the estimating ratio involves a ratio of estimators, plus an estimator of q_{α}. And that both numerator and denominator have finite variances when the power β is less than 2. The paper contains a simulation experiment easily reproduced by the following R code

#biased estimator of kappa(.01) alpha=.01 #tail omalpha=1-alpha T=10^4 #simulations n=10^3 #sample size beta=1.1 #Pareto parameter moobeta=-1/beta kap=rep(0,T) for (t in 1:T){ sampl=runif(n)^moobeta quanta=quantile(sampl,omalpha) kap[t]=sum(sampl[sampl>quanta])/sum(sampl) }

**W**hat is somewhat surprising though is that the paper deems it necessary to run T=10¹² simulations to assess the bias when this bias is already visible in the first digit of κ_{α}. Given that the simulation experiment goes as high as n=10⁸, this means the authors simulated 10²⁰ Pareto variables to exhibit a bias a few thousand replicas could have produced. Checking the numerators and denominators in the above collection of ratios also shows that they may take unbelievably large values.)

“…some theories are built based on claims of such `increase’ in inequality, as in Piketti (2014), without taking into account the true nature of κ, and promulgating theories about the `variation’ of inequality without reference to the stochasticity of theestimation—and the lack of consistency ofκacross time and sub-units.”

**T**he more relevant questions about this issue of estimating κ_{α} are, in my opinion, (a) why this quantity is of enough practical importance to consider its estimation and to seek estimators that would remain robust as the power β varies arbitrarily close to 1; (b) in which sense there is anything more to the phenomenon than the difficulty in estimating β itself; and (c) what is the efficient asymptotic variance for estimating κ_{α} (since there is no particular reason to only consider the most natural estimator). Despite the above quote, that the paper constitutes a major refutation of Piketty’s *Capital in the Twenty-First Century* is rather unlikely!