Distributed Data Processing

GridGain
2012.06.05
Open Source Analysis (Windows, Linux, OS X)
GridGrain offers an alternative to Hadoop’s MapReduce that is compatible with the Hadoop Distributed File System. It offers in-memory processing for fast analysis of real-time data. You can download the open source version from GitHub or purchase a commercially supported version from the link above.
http://www.gridgain.com

Hadoop
2012.06.05
Open Source Analysis (Windows, Linux, OS X)
You simply can’t talk about big data without mentioning Hadoop. The Apache distributed data processing software is so pervasive that often the terms “Hadoop” and “big data” are used synonymously. The Apache Foundation also sponsors a number of related projects that extend the capabilities of Hadoop, and many of them are mentioned below. In addition, numerous vendors offer supported versions of Hadoop and related technologies.
http://hadoop.apache.org

HPCC
2012.06.05
This issue made a man’s life miserable get levitra and didn’t allow them to lead a happy sexual life. The key features of Vital M-40 capsules: This capsule increases libido in making the lovemaking experience memorable. viagra properien appalachianmagazine.com However, if the blood supply to a portion viagra shop usa of the semen. For those of you that generally wear contacts or prescription eyeglasses on a daily purchase viagra without prescription basis, whenever a blepharitis flare-up occurs, it is typically recommended that you ditch the contacts for awhile, and just wear your glasses. Open Source Analysis (Linux)
Developed by LexisNexis Risk Solutions, HPCC is short for “high performance computing cluster.” It claims to offer superior performance to Hadoop. Both free community versions and paid enterprise versions are available.
http://hpccsystems.com

MapReduce
2012.06.05
Open Source Analysis Originally developed by Google, the MapReduce website describe it as “a programming model and software framework for writing applications that rapidly process vast amounts of data in parallel on large clusters of compute nodes.” It’s used by Hadoop, as well as many other data processing applications.
http://hadoop.apache.org/mapreduce

Storm
2012.06.05
Open Source Analysis (Linux)
Now owned by Twitter, Storm offers distributed real-time computation capabilities and is often described as the “Hadoop of realtime.” It’s highly scalable, robust, fault-tolerant and works with nearly all programming languages.
https://github.com/nathanmarz/storm#readme

This entry was posted in Programming. Bookmark the permalink.