Projects / DataCleaner / Releases

All releases of DataCleaner

  •  21 May 2014 12:51
Avatar

    Release Notes: A new major feature, duplicate detection, allows you to fuzzy find duplicate records in your data. A new analyzer for checking referential integrity between tables of multiple sources. Progress Indication has been improved and is more responsive.

    •  15 Mar 2014 03:50
    Avatar

      Release Notes: You can now compose jobs so that a DataCleaner job actually calls/invokes another "child" job as a single transformation. Source column handling was improved, and the user can now choose which columns to include in a source query. Repository file locking was implemented to prevent concurrent reads and writes.

      •  24 Sep 2013 13:16
      Avatar

        Release Notes: The 'Synonym lookup' transformation now has an option to look up every token of the input. This is useful if you're doing replacement of synonyms within the values of a long text field. A potential failure was fixed when blocking execution of DataCleaner jobs through the monitor's Web service. An improvement was made in the way jobs and the sequence of components are closed / cleaned up after execution. The Java WebStart version of DataCleaner was exposed by a bug in the Java runtime causing certain JAR files not to be recognized by the WebStart launcher under certain circumstances.

        •  05 Sep 2013 12:39
        Avatar

          Release Notes: It is now possible to hide output columns of transformations. Hiding will not affect the processing flow, but simply hide them from the user interface, potentially making the experience cleaner when interacting with other components. A new Web service has been added to the monitoring Web application which provides a way to poll the status of the execution of a particular job. A bug has been fixed which caused the HTML report to fail for certain analysis types when no records had been processed. Six other minor bugs have been addressed.

          •  12 Jun 2013 07:55
          Avatar

            Release Notes: This release adds a new filter for performing Change Data Capture, makes execution of jobs queued to avoid concurrent execution issues, and adds several minor bugfixes and improvements.

            •  01 May 2013 08:56
            Avatar

              Release Notes: A major milestone for the data quality monitoring Web application: the addition of connectivity to Salesforce and SugarCRM. Addition of wizards and other user experience improvements. Enables clustered execution of jobs. New data visualization extension and a national identifier validation extension. Adds Pentaho Data Integration job scheduling and execution.

              •  22 Jan 2013 08:39
              Avatar

                Release Notes: A Web service was added to the monitoring application for getting a (list of) metric values. The 'Table lookup' component has been improved by adding join semantics as a configurable property. The EasyDQ components have been upgraded, adding further configuration options and a richer deduplication result interface. Performance improvements have been a specific focus of this release. Improvements have been made in the engine of DataCleaner to further utilize a streaming processing approach in certain corner cases which was not covered previously.

                •  04 Jan 2013 11:59
                Avatar

                  Release Notes: The date and time related analysis options have been expanded, adding distribution analyzers for week numbers, months, and years. An optional "descriptive statistics" option has been added to the Number analyzer and the Date/time analyzer The lines in the timeline charts of the monitoring Web application now have small dots in them. Two new transformers have been added for generating UUIDs and for generating timestamps. Now ad hoc queries can contain DISTINCT clauses, *-wildcards, and subqueries, and are fault-tolerant towards text-case issues.

                  •  17 Dec 2012 18:56
                  Avatar

                    Release Notes: Data Quality KPIs can now be defined as formulas (mathematical expressions), not just raw metrics. It is now possible to fire ad-hoc SQL queries towards all datastores (DB, CSV, Excel, and more). A new analysis option, the Value matcher, was added. With this analysis, it's easy to identify unexpected values in a field. Management of jobs, including copying and deleting jobs, has been made a lot easier by exposing the functionality directly in the UI. It has been made possible to change historic data quality metrics in order to reposition results into the timeline.

                    •  31 Oct 2012 14:32
                    Avatar

                      Release Notes: Adds a service for renaming jobs in the monitoring repository. You can access this as a RESTful Web service or interactively in the UI. A Web service was added for changing the historic date of an analysis result in the monitoring repository. The Web application has been made compatible with legacy JSF containers. Caching of configuration in the Web application was greatly improved, leading to faster page load and job initialization times.

                      Screenshot

                      Project Spotlight

                      episoder

                      A tool to tell you about new episodes of your favourite TV shows.

                      Screenshot

                      Project Spotlight

                      BalanceNG

                      A modern software IP load balancer.