Projects / DataCleaner / Releases

All releases of DataCleaner

  •  12 Oct 2012 12:32
Avatar

    Release Notes: When triggering a job in the monitoring Web application, the panel auto-refreshes every second to get the latest state of the execution. The "Select from key/value map" transformer now supports nested select expressions like "Address.Street" or "orderlines[0].product.name". The table lookup mechanism have been optimized for performance, using prepared statements when running against JDBC databases. Administrators can now download file-based datastores directly from the "Datastores" page.

    •  01 Oct 2012 09:18
    Avatar

      Release Notes: The primary bugfix in this release was about restoring the mapping of columns and specific enumerable categorizations. For instance, in the new Completeness analyzer, after reloading a saved job, the mapping was not always correct. Internal improvements have been made, making it easier to deploy the Web application in environments using the Spring Framework. The visualization settings in the desktop application have been improved by automatically taking a look at the job being visualized and toggling displayed artifacts.

      •  20 Sep 2012 13:39
      Avatar

        Release Notes: A new Web application for DQ Monitoring, Scheduling, and centralized governance was added to the DataCleaner toolset. Results are rendered as HTML pages, and metrics can be extracted and displayed as timeline charts. A new Completeness analyzer was added for identifying incomplete filled records.

        •  30 Apr 2012 14:54
        Avatar

          Release Notes: Support for Apache CouchDB was added (both read and write). A new writer for UPDATE TABLE operations, in addition to the existing INSERT INTO TABLE component. Drill-to-detail information is saved in result files. Improved error handling when connecting to EasyDQ Web services. Manual configuration of the table model when working with NoSQL datastores.

          •  10 Apr 2012 14:00
          Avatar

            Release Notes: A bug has been fixed in the Table lookup transformation which caused it to be unable to have multiple output columns. CSV file escape characters have been made configurable. A minor bug pertaining to empty strings in the Concatenator has been fixed. Support for the Cubrid database has been added. Converter transformations have been adapted to be able to work on multiple fields, not just single fields.

            •  28 Mar 2012 08:42
            Avatar

              Release Notes: This release adds saving, archiving, and sharing of data profiling results, automatic merging of duplicates (golden record creation), checking of contacts in sanction lists (due diligence checks), transformers for NoSQL data structures, specification of datastore connection properties on the commandline, drilling to details in value distribution, more user-friendly database connection configuration, and execution and scheduling of jobs via Pentaho Data Integration/Kettle.

              •  02 Jan 2012 18:52
              Avatar

                Release Notes: This release adds minor bugfixes, performance improvements, and a few new features. Among the important ones are greatly-improved batch loading performance, a convenient "write data" menu in the main window, double-click renaming of job components, syntax coloring in the Javascript transformer and filter, and fixes for a potential deadlock when starting the application.

                •  14 Dec 2011 12:23
                Avatar

                  Release Notes: Support for MongoDB databases, both for read and write operations. Integration with EasyDQ.com, which provides Customer DQ functions in the cloud. Duplicate detection (aka. Deduplication / Fuzzy matching) analyzers. A "Table lookup" component for doing lookups of multiple values from a table. An "Insert into table" component for inserting records into any kind of table (e.g. database tables, CSV files, Excel sheets, or MongoDB collections). Job-level variables which allow for parameterizable jobs that can be instrumented from the command line.

                  •  29 Sep 2011 13:25
                  Avatar

                    Release Notes: International data support: Transliterate transformer and Character set distribution. Pattern Finder and Value Distribution can now perform analysis based on group-by columns. Chart coloring and layout has been improved. Excel spreadsheet writing has been added to output options. Documentation improvements and command line interface support.

                    •  27 Jun 2011 10:40
                    Avatar

                      Release Notes: A new extension architecture allows third party extensions to register in a central marketplace and easily install onto DataCleaner. Support was added for analyzing SAS data sets and fixed width value data sets. Support was added for Japanese characters. Integrity checks were added for failing when CSV file record formats are inconsistent. The documentation was completely rewritten.

                      Screenshot

                      Project Spotlight

                      episoder

                      A tool to tell you about new episodes of your favourite TV shows.

                      Screenshot

                      Project Spotlight

                      BalanceNG

                      A modern software IP load balancer.