Release Notes: A major milestone for the data quality monitoring Web application: the addition of connectivity to Salesforce and SugarCRM. Addition of wizards and other user experience improvements. Enables clustered execution of jobs. New data visualization extension and a national identifier validation extension. Adds Pentaho Data Integration job scheduling and execution.
Release Notes: A new Web application for DQ Monitoring, Scheduling, and centralized governance was added to the DataCleaner toolset. Results are rendered as HTML pages, and metrics can be extracted and displayed as timeline charts. A new Completeness analyzer was added for identifying incomplete filled records.
Release Notes: This release adds saving, archiving, and sharing of data profiling results, automatic merging of duplicates (golden record creation), checking of contacts in sanction lists (due diligence checks), transformers for NoSQL data structures, specification of datastore connection properties on the commandline, drilling to details in value distribution, more user-friendly database connection configuration, and execution and scheduling of jobs via Pentaho Data Integration/Kettle.
Release Notes: Support for MongoDB databases, both for read and write operations. Integration with EasyDQ.com, which provides Customer DQ functions in the cloud. Duplicate detection (aka. Deduplication / Fuzzy matching) analyzers. A "Table lookup" component for doing lookups of multiple values from a table. An "Insert into table" component for inserting records into any kind of table (e.g. database tables, CSV files, Excel sheets, or MongoDB collections). Job-level variables which allow for parameterizable jobs that can be instrumented from the command line.
Release Notes: International data support: Transliterate transformer and Character set distribution. Pattern Finder and Value Distribution can now perform analysis based on group-by columns. Chart coloring and layout has been improved. Excel spreadsheet writing has been added to output options. Documentation improvements and command line interface support.
Release Notes: A new extension architecture allows third party extensions to register in a central marketplace and easily install onto DataCleaner. Support was added for analyzing SAS data sets and fixed width value data sets. Support was added for Japanese characters. Integrity checks were added for failing when CSV file record formats are inconsistent. The documentation was completely rewritten.
Release Notes: Window management was simplified by making most operations available through the single job builder window. Jobs are now stoppable before they have finished. Bar and line charts have been added to a lot of analyzer results. Preview data now contains paging controls to browse further into the data. Most common database drivers are included by default. Various minor improvements and bugfixes were made.
Release Notes: This release adds multi-threaded execution, a commandline interface (runjob.sh/runjob.cmd), some UI updates, and a few bugfixes.
Release Notes: The new online RegexSwap system has been integrated to support browsing and downloading of regexes. Automatic download and installation of popular database drivers. Templates for JDBC connection strings. Profiling and validation results now include detail execution status and monitoring capabilities. Better database and XML file compatibility due to updated MetaModel libraries.
Release Notes: A major update was made to functionality, with lots of new features that were built upon the stabilization release of 1.4. The license was changed to LGPL. New profiles were added for a date mask matcher and a regex matcher. More file types are supported (.dat and .txt). XML file support was improved.