Don’t Use Hadoop - Your Data Isn’t That Big
A still-very-relevant 9-year old article. Pandas has gone from strength to strength since he wrote that. In terms of expressing your computations, Hadoop is strictly inferior to SQL. There is no computation you can write in Hadoop which you cannot write more easily in either SQL, or with a simple P…