The Advanced Linux Sound Architecture is composed of several parts. The first is a fully modularized sound driver which supports module autoloading, devfs, isapnp autoconfiguration, and gives complete access to analog audio, digital audio, control, mixer, synthesizer, DSP, MIDI, and timer components of audio hardware. It also includes a fully-featured kernel-level sequencer, a full compatibility layer for OSS/Free applications, an object-oriented C library which covers and enhances the ALSA kernel driver functionality for applications (client/server, plugins, PCM sharing/multiplexing, PCM metering, etc.), an interactive configuration program for the driver, and some simple utilities for basic management.
libsndfile is a C library for reading and writing sound files such as AIFF, AU, WAV, and others through one standard interface. It can currently read/write 8, 16, 24 and 32-bit PCM files as well as 32 and 64-bit floating point WAV files and a number of compressed formats. It compiles and runs on *nix, MacOS, and Win32.
PHP Voice (formerly known as PHP VXML) contain four classes that assist in developing voice application using PHP. It supports Speech Synthesis Markup Language 1.0, Speech Recognition Grammar Specification 1.0, Voice Browser Call Control: CCXML 1.0, and Voice Extensible Markup Language (VoiceXML) 2.0.
getID3() is a PHP script that extracts useful information from MP3s and other multimedia file formats. It extracts information like the play time, bitrate, and resolution from the following file formats: MP3, Ogg Vorbis, WAV, AVI, Matroska (MKV), MPEG-1 video, Windows Media (ASF, WMA, WMV), RealAudio & RealVideo, Monkey's Audio, FLAC & OggFLAC, VQF, Speex, Musepack (MPC), AAC, AU, VOC, AIFF, OptimFROG, WavPack, LiteWave, LPAC, RKAU, SZIP, Shorten, Quicktime, NSV, MIDI, ZIP, ISO9660, TTA, PhotoCD, BMP, PNG, GIF, and JPEG. It parses information from ID3v1, ID3v2, Ogg Vorbis, Lyrics3 v1 & v2, RIFF (AVI, WAV, CDA, CART, BWF), APE, ReplayGain, PNG, and GIF tags, and can write ID3v1, ID3v2, APE2, FLAC, and Ogg Vorbis tags.
pHash is an implementation of various perceptual hashing algorithms. A perceptual hash is a fingerprint of a multimedia file derived from various features from its content. Unlike cryptographic hash functions that rely on the avalanche effect of small changes in input leading to drastic changes in the output, perceptual hashes are "close" to one another if the features are similar. Potential applications include copyright protection, similarity searches for media files, or even digital forensics.