Release Notes: This release adds a job cost accounting function.
Release Notes: Haruhi now writes a tag with the Hadoop job ID to all line items for the job, so this release can add up line items with this tag to calculate the cost of a job after the fact. When running a flow (multiple jobs), Haruhi now uses the command line arguments of the flow to determine the name of the flow.
Release Notes: Tuning job parameters has sped up the weekly flow from 2.5 hours to about 57 minutes with a small cost reduction. A job to smush objects has been created, so it is now possible to import Dbpedia PageLinks into the :BaseKB space.
Release Notes: This adds the "sumRDF" tool, which sums up RDF values and is necessary for the conversion of DBpedia-derived subjective importance scores to :BaseKB-compatible scores.
Release Notes: This release adds the "smushSubject" tool, which can change the vocabulary used in the subject field using a reduce-side join.
Release Notes: This release moves the "SelfAwareTool" from the telepath project into infovore. This component automatically configures a Hadoop job based on introspection of the environment of the Tool object.
Release Notes: This release adds a rollback to guava 11.0.2 to avoid "dependency hell" problems with Hadoop. There are also improvements, generalization, and cleanups to support reduce-side join handling in telepath.
Release Notes: This release updates to Centipede 99.3, Spring 4, and Guava 16. If you hate XML, you can now configure Infovore with Groovy.
Release Notes: In this release, Haruhi can create a persistent cluster in AWS that does not terminate when it runs out of steps. Haruhi can now submit jobs to a running persistent cluster, which reduces the turnaround in testing small jobs from about ten minutes to two minutes.
Release Notes: This release includes a foreach loop for Haruhi flows, which makes it easy to run a number of similar jobs in sequence. This release also includes a number of refactorings and other changes to support the creation of projects like telepath, which derive from bakemono.