Jenny Heffernan

Recent Blog posts

spark logo

Using Apache Spark for Data Processing: Lessons Learned

Posted on Thursday, June 28, 2018 - 11:54

As a Data Scientist at Acquia I get to build machine learning models to solve problems or speed up tasks that are time consuming for humans. This means I spend a lot of time getting data into a format that is usable by machine learning models, or even just putting it in a useful form for exploratory analysis.

One of the tools I use for handling large amounts of data and getting it into the required format is Apache Spark.

Subscribe to Recent Blog posts