nikoport.com nikoport.com

Batch Mode – part 1 (“The Beginning”)

This is the very first blog post in a new series on the Batch Mode, welcome & enjoy! I will be going into every detail of Batch Mode that I know off in the upcoming blog posts (yes, there will be a lot of them), but in the very first one I decided to make even more public the case I have been pushing for the last 8 months: implementing Batch Mode for RowStore. Batch Mode Batch Mode is a special query processing mode, which targets...

ianozsvald.com ianozsvald.com

A tiny foray into Apache Spark & Python

I’ve spent an afternoon playing with Apache Spark (1.0.1) to start to form an opinion on where it might be useful. Here’s a couple of notes. We’re discussing this at PyDataLondon tonight. You can run Spark out of the box on Linux (I’m using 13.10) without having Hadoop or HDFS installed, this makes quick experimentation easy. Having downloaded spark-1.0.1-bin-hadoop2.tgz I followed the README’s advice of...