Release Notes: When triggering a job in the monitoring Web application, the panel auto-refreshes every second to get the latest state of the execution. The "Select from key/value map" transformer now supports nested select expressions like "Address.Street" or "orderlines.product.name". The table lookup mechanism have been optimized for performance, using prepared statements when running against JDBC databases. Administrators can now download file-based datastores directly from the "Datastores" page.
Release Notes: The primary bugfix in this release was about restoring the mapping of columns and specific enumerable categorizations. For instance, in the new Completeness analyzer, after reloading a saved job, the mapping was not always correct. Internal improvements have been made, making it easier to deploy the Web application in environments using the Spring Framework. The visualization settings in the desktop application have been improved by automatically taking a look at the job being visualized and toggling displayed artifacts.
Release Notes: A new Web application for DQ Monitoring, Scheduling, and centralized governance was added to the DataCleaner toolset. Results are rendered as HTML pages, and metrics can be extracted and displayed as timeline charts. A new Completeness analyzer was added for identifying incomplete filled records.
Release Notes: Support for Apache CouchDB was added (both read and write). A new writer for UPDATE TABLE operations, in addition to the existing INSERT INTO TABLE component. Drill-to-detail information is saved in result files. Improved error handling when connecting to EasyDQ Web services. Manual configuration of the table model when working with NoSQL datastores.
Release Notes: A bug has been fixed in the Table lookup transformation which caused it to be unable to have multiple output columns. CSV file escape characters have been made configurable. A minor bug pertaining to empty strings in the Concatenator has been fixed. Support for the Cubrid database has been added. Converter transformations have been adapted to be able to work on multiple fields, not just single fields.
Release Notes: This release adds saving, archiving, and sharing of data profiling results, automatic merging of duplicates (golden record creation), checking of contacts in sanction lists (due diligence checks), transformers for NoSQL data structures, specification of datastore connection properties on the commandline, drilling to details in value distribution, more user-friendly database connection configuration, and execution and scheduling of jobs via Pentaho Data Integration/Kettle.
Release Notes: Support for MongoDB databases, both for read and write operations. Integration with EasyDQ.com, which provides Customer DQ functions in the cloud. Duplicate detection (aka. Deduplication / Fuzzy matching) analyzers. A "Table lookup" component for doing lookups of multiple values from a table. An "Insert into table" component for inserting records into any kind of table (e.g. database tables, CSV files, Excel sheets, or MongoDB collections). Job-level variables which allow for parameterizable jobs that can be instrumented from the command line.
Release Notes: International data support: Transliterate transformer and Character set distribution. Pattern Finder and Value Distribution can now perform analysis based on group-by columns. Chart coloring and layout has been improved. Excel spreadsheet writing has been added to output options. Documentation improvements and command line interface support.
Release Notes: A new extension architecture allows third party extensions to register in a central marketplace and easily install onto DataCleaner. Support was added for analyzing SAS data sets and fixed width value data sets. Support was added for Japanese characters. Integrity checks were added for failing when CSV file record formats are inconsistent. The documentation was completely rewritten.