社科网首页|客户端|官方微博|报刊投稿|邮箱 中国社会科学网
  home   >  Resources

Big data: new tricks for econometrics


Last week my research group discussed Hal Varian’s interesting new paper on “Big data: new tricks for econometrics”, Journal of Economic Perspectives, 28(2): 3–28. It’s a nice introduction to trees, bagging and forests, plus a very brief entree to the LASSO and the elastic net, and to slab and spike regression. Not enough to be able to use them, but ok if you’ve no idea what they are. It was more disappointing on boosting (completely ignoring the fact that boosting can be applied in a regression context as well as a classification context), and his comments on causality seemed curiously naive. His suggested approach involved forecasting using all variables but the one that is considered causal, and then comparing the results against what actually happened. That seems at least as likely to lead to false conclusions on causality as instrumental variables or differences-in-differences. Although Varian cites Pearl’s work approvingly, I doubt that Pearl would return the favour. On a positive note, his Bayesian Structural Time Series model (which I heard him speak about in Rome 12 months ago) seems interesting and very useful. I wonder when the promised R package will appear?

National Institute of Social Development, CASS
Address: Floor 8, China Commerce Tower, No. 5, Sanlihe Donglu,
Xicheng District, Beijing
Postal Code: 100045
Tel: (86 10) 010-65124193
Fax: (86 10) 010-68530989
Email: isd@cass.org.cn
Site map   |   link  |   Legal   |   PC record ID: Beijing ICP preparation 05009132 All Rights Reserved Copyright: Research on Chinese Social Development