DromeAudio is a small audio manipulation and playback library. It features a simple API for loading, generating, processing, and playing audio. Some of its features include loading and saving WAV sounds, loading Ogg Vorbis sounds, audio mixing/playback, and dynamic audio processing effects such as pitch shifting and echo.
pHash is an implementation of various perceptual hashing algorithms. A perceptual hash is a fingerprint of a multimedia file derived from various features from its content. Unlike cryptographic hash functions that rely on the avalanche effect of small changes in input leading to drastic changes in the output, perceptual hashes are "close" to one another if the features are similar. Potential applications include copyright protection, similarity searches for media files, or even digital forensics.
irrKlang is a high level 2D and 3D cross platform sound engine and audio library that plays WAV, MP3, OGG, MOD, XM, IT, S3M, and other file formats. It can be used by C++ and all .NET languages (C#, Visual Basic.NET, etc.). It has all of the features common to low-level audio libraries as well as lots of useful features like a sophisticated streaming engine, extendable audio reading, single and multi-threading modes, 3D audio emulation for low-end hardware, a plugin system, multiple rolloff models, sound effects, and more. All of these can be accessed via an extremely simple API using an object oriented design.
Mediastreamer is a portable C library that allows you to create and run audio and video streams. It is designed for any kind of voice over IP applications. It features RTP connectivity, audio codecs (Speex, iLBC, G711, GSM, and AMR), video codecs (MPEG4, H263, H264, and Theora), sound card I/O, wav file streaming, webcam video capture, echo-cancellation, conferencing, parametric equalization, and various other utilities. It has a modular design that makes it extensible through plugins. This is the media-streaming component of linphone, a GPL SIP video phone.
CLAM is a multiplatform software framework for research and application development for the Audio and Music domains. It offers a conceptual model for building systems as well as tools for the analysis, synthesis, and transformation of audio signals using high level representation (notes, spectral peaks, etc.). It also provides multiplatform system abstraction and tools for audio and MIDI file handling, device handling, GUI and XML support, and more. It has proper Linux audio architecture integration (Ladspa, Alsa, OSC, Jack, VST, etc.) and rapid graphical prototyping both for the processing core and the user interface (Qt designer integration).
PyKaraoke is a karaoke player which supports the CDG (MP3+G, OGG+G), MIDI (.KAR, .MID), and MPEG formats. The project consists of a GUI frontend and separate pycdg, pykar, and pympg player modules. The GUI provides a full application frontend, with playlist functionality and a searchable database. The pycdg, pykar, and pympg modules can also be used as standalone command-line players or imported into any Python programs that would like CDG, MIDI/KAR, or MPEG playback.