Interesting. This might invalidate their results. Perhaps 1000 nodes give MapReduce an advantage on a smaller dataset? 1000 nodes is not that large of a number these days. And you get 10x the disk bandwidth, which explains the speed of loading.
Plus the "more challenging" comment above. Conclusion is far from clear.