Talend Open Studio for Data Quality helps you to profile your data. The ergonomic interface allows you to define metrics (indicators) and collect statistics on your data in a few clicks. It comes with a set of regular expressions that helps you to identify bad data. You can create your own regular expressions and use them in data profiling analyses. A lot of options exist for each indicator, which change the behavior of the indicator so that it gives you more pertinent information. Data quality options on indicators alert you when your data quality is not what you expected.
FastFlow is a pattern-based programming framework targeting streaming applications. It implements pipeline, farm, divide and conquer, and their composition, as well as generic streaming networks. It is specifically designed to support the development and the seamless porting of existing applications on multi-core, GPGPUs, and clusters of them. The layered template-based C++ design ensures flexibility and extendibility. Its lock-free/fence-free run-time support minimizes cache invalidation traffic and enforces the development of high-performance (high-throughput, low-latency) scalable applications. It has been proven comparable or faster than TBB, OpenMP, and Cilk on several micro-benchmarcks and real-world applications, especially when dealing with fine-grained parallelism and high-throughput applications.
Arcavias is an e-commerce framework for Web shops, consisting of an e-commerce library, an administration interface, and different frontends. It's a modular stack which offers flexibility and speed. Unlike other Web shops, it's not a monolithic application. Instead, you have a choice between several user frontends, and you can adapt them to your needs or write one of your own.
abiCloud is a cloud computing platform manager that allows you to easily deploy a private cloud infrastructure. One of the key features is the Web rich interface for managing the infrastructure. You can deploy a new service just dragging and dropping a virtual machine. It allows you to deploy instances over VirtualBox, VMware, KVM, and Xen. It features user management through ACL, infrastructure and network management, an appliance repository, and the ability to easily design virtual datacenters.
Talend Open Studio for ESB is an innovative, Eclipse-based tooling environment for modeling, configuring, and deploying integration solutions using the Apache-based open source enterprise service bus (ESB), Talend ESB. Talend ESB is a versatile and flexible ESB that allows organizations to address integration challenges, from simple departmental projects to complex, heterogeneous IT environments. Powered by the Apache CXF, Apache Camel, and Apache ActiveMQ Open Source integration projects, it makes enterprise-class integration accessible by delivering a cost-effective and easy-to-use way to integrate and expand systems and applications.
The pbs_python package is a wrapper class for the Torque Batch System C library. With this package, you now can write utilities and extensions in Python instead of C. This package was developed to replace xpbsmon by an ASCII version named pbsmon. PBSQuery is also included in this package. This is a Python module built on top of the pbs Python module to simplify querying the batch server. There are a lot of examples included in the source package.
Duplicati is a backup system and an implementation of Duplicity. Duplicati enables you to set up and monitor backups on Windows or Linux. Duplicati is based on the Linux-only Duplicity backup system. Duplicati has a very user friendly interface that allows you to configure and monitor your backups. It supports backing up to a variety of destinations, such as a network folder, a USB disk, a remote FTP server, a remote SSH server, Amazon S3 storage, or a WebDAV enabled server. All backups can be encrypted and are digtally signed, so that no other person can read or tamper with the backed up data.
StarCluster is a utility for creating traditional computing clusters used in research labs or for general distributed computing applications on Amazon's Elastic Compute Cloud (EC2). It uses a simple configuration file provided by the user to request cloud resources from Amazon and to automatically configure them with a queuing system, an NFS shared /home directory, passwordless SSH, OpenMPI, and ~140GB scratch disk space. It consists of a Python library and a simple command line interface to the library. For end-users, the command line interface provides simple intuitive options for getting started with distributed computing on EC2 (i.e. starting/stopping clusters, managing AMIs, etc). For developers, the library wraps the EC2 API to provide a simplified interface for launching/terminating nodes, executing commands on the nodes, copying files to/from the nodes, etc.
POP-C++ is a comprehensive object-oriented system for developing applications in large distributed computing infrastructures such as Grid, P2P or Clouds. It consists of a programming suite (language, compiler) and a run-time system for running POP-C++ applications. The POP-C++ language is a minimal extension of C++ that implements the parallel object model with the integration of resource requirements into distributed objects. This extension is as close as possible to standard C++ so that programmers can easily learn POP-C++ and so that existing C++ libraries can be parallelized using POP-C++ without too much effort. The POP-C++ run-time is an object-oriented open design that aims at integrating different distributed computing tool kits into an infrastructure for executing requirement-driven object-oriented applications. It uses objects to serve objects: the system provides services for executing remote objects.