Spark has tremendous value when it comes to aggregation using its flexible DataFrame API. It enables you to execute performant jobs in a distributed fashion. A DataFrame is nothing but a Dataset organized into named columns which makes the concept similar to the data frames...

Integrating Apache Spark with H2O is surprisingly simple when you know what you are doing. In this post, we'll cover multiple data transformations with Spark. The data will then be fed into the H2O AutoML algorithm. Finally, we'll use the best H2O model to make a prediction....

I spent the past 5 years working on Lean and Six Sigma improvement projects. I was lucky enough to work in both manufacturing and service delivery problems at global scale. I always had my focus on reducing average handling time, improving accuracy and delivering year...