DataCleaner is a data quality analysis tool that allows you to perform data profiling, validating, and minor ETL-like tasks. These activities help you administer and monitor your data quality in order to ensure that your data is useful and applicable to your business situation. It can be used for master data management (MDM) methodologies, data warehousing projects, statistical research, preparation for extract-transform-load activities, and more.
Piggydb is a flexible and scalable knowledge building platform that supports a heuristic or bottom-up approach to discover new concepts or ideas based on your input. You can begin with using it as a flexible outliner, diary or notebook, and as your database grows, Piggydb helps you to shape or elaborate your own knowledge. Piggydb is a Web application provided as a self-contained package that contains a Web server and database engine.
Talend Open Studio for Data Quality helps you to profile your data. The ergonomic interface allows you to define metrics (indicators) and collect statistics on your data in a few clicks. It comes with a set of regular expressions that helps you to identify bad data. You can create your own regular expressions and use them in data profiling analyses. A lot of options exist for each indicator, which change the behavior of the indicator so that it gives you more pertinent information. Data quality options on indicators alert you when your data quality is not what you expected.
The TMHarvest library is based on TM4J and provides a convenient way to automatically generate topic maps from different data sources. A rules files with embedded templates (written in XML) defines from which data sources topic map constructs should be taken into account, as well how new or existent topics should be associated.
EZ Reusable Objects (EZRO) is a Web application that can be used by non-technical staff to manage content as "objects." Content objects containing text, video, and audio can be shared, modified, and re-styled to appear via a traditional Web site, an on-line course, an innovative "Coach," or as a community of interest site. It is highly scalable and can be used for public Web sites, secure environments, and private intra/extranets.
MKSearch is a metadata search engine that indexes structured metadata in Web documents instead of free text in the document body. The data acquisition system conforms to the Dublin Core metadata in HTML recommendations, and supports other application profiles, such as the UK e-Government Metadata Standard. It also indexes native RDF formats, including RSS 1.0. The system has five major components: a Web crawler, an HTML document validator and formatter, a set of custom indexers, an RDF storage and query system, and a public query interface, provided through a standard servlet container.
PIVIAU is a PHP/MySQL Web based gallery for pictures, videos, and audio that supports tags and EXIF. Albums and pictures have searchable tag, country, city, place, date, subject, and author. An RSS feed is available for every seach you can do, or for recent pictures. There is an AJAX-powered slideshow. Pictures are resized using PHP GD image functions. Videos are viewable a la Google Video using a video flash player. Audio (WAV and MP3) is supported via the embed HTML tag. EXIF is supported via PHP exif functions to extract the picture date.
Lobotomy involves many sub-projects oriented to experimentation about new design for human-computer interaction and, more generally, a new way for home computing. It involves a relational filesystem, a window manager, and many libraries, tools, and daemons to automatically extract and handle metadata.
Project35 is an application suite that allows users to generate data entry forms from XML schema. Application designers use a Configuration Tool to associate records and record fields defined in the schema with application properties that include features such as: validation services, controlled vocabulary services, general plugins, and various aspects of look-and-feel.
The xattr command lists, writes (sets), prints (displays), and deletes the extended attributes of file system objects. An extended attribute is a name:data (key:value) pairing arbitrarily associated with a file system object (e.g. file, directory, or symbolic link). The name of an extended attribute may be any null-terminated UTF-8 string. The data associated with it may be either textual or binary. This implementation of the xattr command is designed to be backwards-compatible with the Python script that shipped as /usr/bin/xattr in Mac OS 10.5.0 (Leopard).