Release Notes: A new Web application for DQ Monitoring, Scheduling, and centralized governance was added to the DataCleaner toolset. Results are rendered as HTML pages, and metrics can be extracted and displayed as timeline charts. A new Completeness analyzer was added for identifying incomplete filled records.
Release Notes: Support for Apache CouchDB was added (both read and write). A new writer for UPDATE TABLE operations, in addition to the existing INSERT INTO TABLE component. Drill-to-detail information is saved in result files. Improved error handling when connecting to EasyDQ Web services. Manual configuration of the table model when working with NoSQL datastores.
Release Notes: A bug has been fixed in the Table lookup transformation which caused it to be unable to have multiple output columns. CSV file escape characters have been made configurable. A minor bug pertaining to empty strings in the Concatenator has been fixed. Support for the Cubrid database has been added. Converter transformations have been adapted to be able to work on multiple fields, not just single fields.
Release Notes: This release adds saving, archiving, and sharing of data profiling results, automatic merging of duplicates (golden record creation), checking of contacts in sanction lists (due diligence checks), transformers for NoSQL data structures, specification of datastore connection properties on the commandline, drilling to details in value distribution, more user-friendly database connection configuration, and execution and scheduling of jobs via Pentaho Data Integration/Kettle.
Release Notes: Support for MongoDB databases, both for read and write operations. Integration with EasyDQ.com, which provides Customer DQ functions in the cloud. Duplicate detection (aka. Deduplication / Fuzzy matching) analyzers. A "Table lookup" component for doing lookups of multiple values from a table. An "Insert into table" component for inserting records into any kind of table (e.g. database tables, CSV files, Excel sheets, or MongoDB collections). Job-level variables which allow for parameterizable jobs that can be instrumented from the command line.
Release Notes: International data support: Transliterate transformer and Character set distribution. Pattern Finder and Value Distribution can now perform analysis based on group-by columns. Chart coloring and layout has been improved. Excel spreadsheet writing has been added to output options. Documentation improvements and command line interface support.
Release Notes: A new extension architecture allows third party extensions to register in a central marketplace and easily install onto DataCleaner. Support was added for analyzing SAS data sets and fixed width value data sets. Support was added for Japanese characters. Integrity checks were added for failing when CSV file record formats are inconsistent. The documentation was completely rewritten.
Release Notes: Quick filtering of datastores was added. Reference data for countries is now provided. Minor UI improvements were made. Support was added for adding extension packages. A command line interface for executing jobs was added. Number formatting options were added in the "Convert to Number" transformer.
Release Notes: Window management was simplified by making most operations available through the single job builder window. Jobs are now stoppable before they have finished. Bar and line charts have been added to a lot of analyzer results. Preview data now contains paging controls to browse further into the data. Most common database drivers are included by default. Various minor improvements and bugfixes were made.