screen-scraper is a tool for extracting data from Web sites. It works much like a database that provides access to the information of the Web. It provides a graphical interface allowing you to designate URLs, data elements to be extracted, and scripting logic to traverse pages and work with scraped data. Once these items have been created, screen-scraper can be invoked from external languages such as .NET, Java, PHP, and Active Server Pages. It can be scheduled to scrape information at periodic intervals, and can automatically write extracted data to CSV files.
focuseek searchbox is a family of easily installable full-text search engines that can spider Internet and intranet data sources (Web sites, newsgroups, FTP sites, and others) or index data you feed to it and make it available for searching. It supports a variety of input formats (among them HTML, PDF, Microsoft Word DOC, and RTF), and is easily scriptable via SOAP and extendable through plugins. It can scale to millions of documents and comes with a full-fledged GUI client, a built in Web search portal, and an RSS server.
Lasso (Liberty Alliance Single Sign On) is an implementation of the Liberty Alliance specifications. These specifications define protocols for federated identities, single sign-on, etc. Lasso supports ID-FF 1.2, SAML 2.0, and parts of ID-WSF. It provides both a C library and bindings for several languages (Python, Java, Perl, PHP 4, and PHP 5).
mojoPortal is a cross-platform object oriented Web site framework. It supports PostgreSQL, MySQL, Firebird, SQLite and MS SQL for the backend. It includes a content management system, forums, blogs, photo galleries, newsletter, polls, surveys, an event calendar, an RSS feed aggregator, and a skinnable design.
The DataOne Software XML VIN Decoder service is platform independent software. A VIN is simply passed to the software and a response is provided in XML format. This service represents a way to gain instant access to tier one industry automotive data without the overhead of development and server infrastructure.
Crowd is a Web-based single sign-on (SSO) tool that simplifies application provisioning and identity management. It has connectors for popular open source projects like Apache (via Perl) and Subversion. Pre-built connectors for other popular commercial products like Jive Forums, JIRA, and Confluence are also available.