RSS 7 projects tagged "CUDA"

No download No website Updated 02 Jun 2009 DCGN

Screenshot
Pop 13.89
Vit 38.07

DCGN is a point-to-point and collective communication library written for clusters of GPUs. DCGN allows one to issue sends, receives, and collective communications via CUDA kernels. The library handles the message passing, allowing the user to focus solely on the algorithm.

Download No website Updated 15 Feb 2011 Pyrit

Screenshot
Pop 125.38
Vit 2.60

Pyrit takes a step ahead in attacking WPA-PSK and WPA2-PSK, the protocols that protect today's public WiFi-airspace. Pyrit's implementation allows you to create massive databases, pre-computing part of the WPA/WPA2-PSK authentication phase in a space-time-tradeoff. The performance gain for real-world-attacks is in the range of three orders of magnitude, which urges for re-consideration of the protocol's security. It exploits the computational power of multiple cores and other platforms through ATI-Stream, Nvidia CUDA, OpenCL, and VIA Padlock. It is a powerful attack against one of the world's most used security-protocols.

Download Website Updated 15 Jul 2010 Jacket by AccelerEyes

Screenshot
Pop 27.24
Vit 1.49

The Jacket platform consists of a runtime and language processing system that automatically optimizes existing applications or new algorithms for GPU computing. Jacket currently supports the MATLAB language as a frontend to the platform. Jacket's language processing system automatically translates MATLAB code to high performance primitives required for best utilization of Nvidia, CUDA capable GPUs. Working in concert with the translation system, Jacket's runtime system optimizes memory transfers, compiles code on-the-fly for realtime tuned performance, and launches GPU kernels efficiently for maximal performance. All GPU-specific programming details are handled by Jacket, freeing the user to focus on science, engineering, and analytics.

No download Website Updated 05 Oct 2010 Charm++

Screenshot
Pop 24.52
Vit 1.00

Charm++ is a portable adaptive runtime system for parallel applications. Application developers create an object-based decomposition of the problem of interest, and the runtime system manages issues of communication, mapping, load balancing, fault tolerance, and more. Sequential code implementing the methods of these parallel objects is written in C++. Calls to libraries in C++, C, and Fortran are common and straightforward. Charm++ is portable across individual workstations, clusters, accelerators (Cell SPEs and GPUs), and supercomputers such as those sold by IBM (Blue Gene, POWER) and Cray (XT3/4/5/6). Applications based on Charm++ are used on at least 5 of the 20 most powerful computers in the world.

Download No website Updated 23 Feb 2012 Theano

Screenshot
Pop 50.28
Vit 2.84

Theano is a Python library that allows you to define, optimize, and evaluate mathematical expressions involving multi-dimensional arrays efficiently. Theano features tight integration with numpy, transparent use of a GPU, efficient symbolic differentiation, speed and stability optimizations, dynamic C code generation, and extensive unit-testing and self-verification. Theano has been powering large-scale computationally intensive scientific investigations since 2007. But it is also approachable enough to be used in the classroom (IFT6266 at the University of Montreal).

Download Website Updated 20 Jan 2011 libfairydust

Screenshot
Pop 19.62
Vit 29.19

Libfairydust is a small wrapper library intended for use with GPU clusters that 'hijacks' CUDA and OpenCL calls. It can be used to 're-route' calls to a certain GPU, so a process requesting GPU#0 might end up running on GPU#4 without knowing (or caring) about it. This works completely transparently and does not need any sort of 'cooperation' from the application, changes to code, or relinking.

Download Website Updated 18 May 2011 Moscrack

Screenshot
Pop 86.27
Vit 3.19

Moscrack is a WPA cracker for use on clusters. It supports MOSIX, SSH, and RSH connectivity and works by reading a word list from STDIN or a file, breaking it into chunks, and passing those chunks off to separate processes that run in parallel. The parallel processes are then executed on different nodes in your cluster. All results are checked and recorded on your master node. Logging and error handling are taken care of. It is capable of running reliably for long periods of time, without the risk of losing data or having to restart. Moscrack uses aircrack-ng by default. Pyrit for WPA cracking and Dehasher for Unix password hashes are supported via plugins.

Screenshot

Project Spotlight

read feed

A command line tool that reads feeds from any source.

Screenshot

Project Spotlight

Teia

A simple CSS Framework for Web projects.