In the Depths of the Cloud, Open Source and Proprietary Leviathans Fight to the Death
Jono Bacon Asked Google Home ‘Who Founded Linux?’ You Won’t Believe What Happened Next!
Red Hat's Women in Open Source Award Winners, 2017
Imagine an Android Phone Without Linux Inside
Linus Torvalds Talks to Debian Users
Mozilla Relents, Thunderbird Can Stay
Heed the Prophet Stallman, oh Software Sinners!
May 13th, 2016

Government Analytics Forum: Handling Big Data With Apache Spark

The Video Screening Room

If you’re like us, your eyes glaze over whenever the subject of big data or Hadoop comes up. Watch this video and you’ll be able to join the conversation the next time the subject is broached.

When you’re talking big data analysis, you’re almost always talking open source. Apache Hadoop is what often comes to mind as a valuable big data analysis tool. But do you know the advantages that Apache Spark has to offer? This May 5 presentation from IBM’s Government Analytics Forum in Washington, DC does a nice job of explaining the advantages.

My takeaway from this video? Apache Spark has large speed advantages over Apache Hadoop — and speed of data analysis is often vital. Version 2.0 of Spark is due out later this year. What is IBM’s interest in Apache Spark? It runs on IBM Bluemix.

For the past 10 years, Phil has been working at a public library in the Washington D.C.-area, helping youth and adults use the 28 public Linux stations the library offers seven days a week. He also writes for MAKE magazine, and TechSoup Libraries. Suggest videos by contacting Phil on Twitter or at

Comments are closed.