9 Upcoming Open Source Big Data Technologies


#3. Cascading:

Available under the GNU General Public License, Cascading is an open source software abstraction layer for Hadoop. It allows users to create and execute data processing workflows on Hadoop clusters using any JVM-based language. Designed by Chris Wensel as an alternative against the complexity of MapReduce jobs, Cascading provides support to a number of sectors in ad targeting, log file analysis, bioinformatics, machine learning, predictive analytics, Web content mining and ETL applications. Wensel also developed a firm called Concurrent to provide commercialization support for Cascading.