Version 2.4.1 of DataCleaner

Release Notes: This release adds minor bugfixes, performance improvements, and a few new features. Among the important ones are greatly-improved batch loading performance, a convenient "write data" menu in the main window, double-click renaming of job components, syntax coloring in the Javascript transformer and filter, and fixes for a potential deadlock when starting the application.

Other releases

Release Notes: You can now compose jobs so that a DataCleaner job actually calls/invokes another "child" job as a single transformation. Source column handling was improved, and the user can now choose which columns to include in a source query. Repository file locking was implemented to prevent concurrent reads and writes.

  •  24 Sep 2013 23:52

Release Notes: The 'Synonym lookup' transformation now has an option to look up every token of the input. This is useful if you're doing replacement of synonyms within the values of a long text field. A potential failure was fixed when blocking execution of DataCleaner jobs through the monitor's Web service. An improvement was made in the way jobs and the sequence of components are closed / cleaned up after execution. The Java WebStart version of DataCleaner was exposed by a bug in the Java runtime causing certain JAR files not to be recognized by the WebStart launcher under certain circumstances.

Release Notes: It is now possible to hide output columns of transformations. Hiding will not affect the processing flow, but simply hide them from the user interface, potentially making the experience cleaner when interacting with other components. A new Web service has been added to the monitoring Web application which provides a way to poll the status of the execution of a particular job. A bug has been fixed which caused the HTML report to fail for certain analysis types when no records had been processed. Six other minor bugs have been addressed.

Release Notes: This release adds a new filter for performing Change Data Capture, makes execution of jobs queued to avoid concurrent execution issues, and adds several minor bugfixes and improvements.

Release Notes: A major milestone for the data quality monitoring Web application: the addition of connectivity to Salesforce and SugarCRM. Addition of wizards and other user experience improvements. Enables clustered execution of jobs. New data visualization extension and a national identifier validation extension. Adds Pentaho Data Integration job scheduling and execution.


Project Spotlight


A deduplication-based filesystem for Windows and Linux (SDFS).


Project Spotlight


A server logfile statistic analysis program.