Hadoop HBase ingestion of Microsoft Exchange |
The Esri Geometry API for Java enables developers to write custom applications for analysis of spatial data. This API is used in the Esri GIS Tools for Hadoop and other 3rd-party data processing solutions. |
|
Splout SQL "starter" example for splout-hadoop Java API - log analysis using Cascading & serving using Splout SQL |
|
Talk on 'Getting Started with Hadoop' |
Hadoop Map Reduce apache loganalyzer |
Compiling Taverna Workflows to native Hadoop Programs |
Simple Hadoop app using the HadoopBAM library |
A Heritrix 3 writer processor for storing crawled data in the Hadoop Distributed File System |
Distributed database specialized in exporting key/value data from Hadoop |
This toolkit consists of implementations of various graph-based semi-supervised learning (SSL) algorithms. Currently, three algorithms are implemented: Gaussian Random Fields (GRF), Adsorption, and Modified Adsorption (MAD). Junto also contains Hadoop-based implementations of these three algorithms. |
The GIS Tools for Hadoop are a collection of GIS tools for spatial analysis of big data. |
The Spatial Framework for Hadoop allows developers and data scientists to use the Hadoop data processing system for spatial data analysis. |
HiTune is a Hadoop performance analyzer. See trouble shooting and known issues here |
An example project for running an m/r job on hadoop, with input from Riak |
A patched Jetty 6.1.26 for use in Hadoop |
A bunch of utility classes for Java , Hadoop, HBase, Pig, etc. |
Common metadata layer for Hadoop's Map Reduce, Pig, and Hive |
Hadoop map reduce article I wrote for Java Tech Journal |
From Hadoop Definitive Guide + online tutorials |
Mirror of Apache Hadoop ZooKeeper |
A means of interfacing the Curator with Hadoop for parallel batch processing |
Hadoop WebHDFS REST API's java client code with kerberos auth. |
Mavuno: A Hadoop-Based Text Mining Toolkit |
Scripting Languages on Hadoop: Jaql vs. Pig Latin (MapReduce stuff) |
A very simple example of using Hadoop's MapReduce functionality in Java . |
A re-architecting of the Hadoop web applications open for all to contribute. |
Hadoop-ec2 tool written in java . |
An implementation of a real-world map-reduce workflow in each major framework. |
AppScale is an open-source hybrid cloud platform. AppScale implements a number of popular APIs including those of Google App Engine, MapReduce (via Hadoop), MPI and others. AppScale executes as ... |
HBase-util is open source module that enables it to store bean class directly into HBase tables ( running on the Hadoop Distributed FileSystem (this project contributed apache hbase(This i... |
This is a multi-class SVM classifier on Hadoop for large scale data set. |