Talend Open Studio for Data Quality helps you to profile your data. The ergonomic interface allows you to define metrics (indicators) and collect statistics on your data in a few clicks. It comes with a set of regular expressions that helps you to identify bad data. You can create your own regular expressions and use them in data profiling analyses. A lot of options exist for each indicator, which change the behavior of the indicator so that it gives you more pertinent information. Data quality options on indicators alert you when your data quality is not what you expected.
Arcavias is an e-commerce framework for Web shops, consisting of an e-commerce library, an administration interface, and different frontends. It's a modular stack which offers flexibility and speed. Unlike other Web shops, it's not a monolithic application. Instead, you have a choice between several user frontends, and you can adapt them to your needs or write one of your own.
FastFlow is a pattern-based programming framework targeting streaming applications. It implements pipeline, farm, divide and conquer, and their composition, as well as generic streaming networks. It is specifically designed to support the development and the seamless porting of existing applications on multi-core. The layered template-based C++ design ensures flexibility and extendibility. Its lock-free/fence-free run-time support minimizes cache invalidation traffic and enforces the development of high-performance (high-throughput, low-latency) scalable applications. It has been proven faster than TBB, OpenMP, and Cilk on several micro-benchmarcks and real-world applications, especially when dealing with fine-grained parallelism and high-throughput applications.
Kolab is a groupware system that provides a comprehensive and flexible solution for workgroups. Email, contacts, and group calendars can be managed via the Web, Windows/Outlook, and Linux/KDE. It supports offline synchronization and full seamless support of mixed client environments because the Kolab-XML storage format is fully open and avoids MAPI/TNEF.
Talend Open Studio for ESB is an innovative, Eclipse-based tooling environment for modeling, configuring, and deploying integration solutions using the Apache-based open source enterprise service bus (ESB), Talend ESB. Talend ESB is a versatile and flexible ESB that allows organizations to address integration challenges, from simple departmental projects to complex, heterogeneous IT environments. Powered by the Apache CXF, Apache Camel, and Apache ActiveMQ Open Source integration projects, it makes enterprise-class integration accessible by delivering a cost-effective and easy-to-use way to integrate and expand systems and applications.
abiCloud is a cloud computing platform manager that allows you to easily deploy a private cloud infrastructure. One of the key features is the Web rich interface for managing the infrastructure. You can deploy a new service just dragging and dropping a virtual machine. It allows you to deploy instances over VirtualBox, VMware, KVM, and Xen. It features user management through ACL, infrastructure and network management, an appliance repository, and the ability to easily design virtual datacenters.
The pbs_python package is a wrapper class for the Torque Batch System C library. With this package, you now can write utilities and extensions in Python instead of C. This package was developed to replace xpbsmon by an ASCII version named pbsmon. PBSQuery is also included in this package. This is a Python module built on top of the pbs Python module to simplify querying the batch server. There are a lot of examples included in the source package.
Duplicati is a backup system and an implementation of Duplicity. Duplicati enables you to set up and monitor backups on Windows or Linux. Duplicati is based on the Linux-only Duplicity backup system. Duplicati has a very user friendly interface that allows you to configure and monitor your backups. It supports backing up to a variety of destinations, such as a network folder, a USB disk, a remote FTP server, a remote SSH server, Amazon S3 storage, or a WebDAV enabled server. All backups can be encrypted and are digtally signed, so that no other person can read or tamper with the backed up data.
StarCluster is a utility for creating traditional computing clusters used in research labs or for general distributed computing applications on Amazon's Elastic Compute Cloud (EC2). It uses a simple configuration file provided by the user to request cloud resources from Amazon and to automatically configure them with a queuing system, an NFS shared /home directory, passwordless SSH, OpenMPI, and ~140GB scratch disk space. It consists of a Python library and a simple command line interface to the library. For end-users, the command line interface provides simple intuitive options for getting started with distributed computing on EC2 (i.e. starting/stopping clusters, managing AMIs, etc). For developers, the library wraps the EC2 API to provide a simplified interface for launching/terminating nodes, executing commands on the nodes, copying files to/from the nodes, etc.