RSS 25 projects tagged "Windows"

Download Website Updated 02 Aug 2006 HTMLDOC

Screenshot
Pop 725.33
Vit 7.72

HTMLDOC converts HTML files and Web pages into indexed HTML, PostScript, and PDF files suitable for online viewing and printing. It can be used as a standalone GUI application, in a batch document processing environment, as a Web-based report generation application, or in embedded environments to support printing of HTML content. It runs on all Unix platforms as well as Mac OS X and Windows 2000 and higher.

Download Website Updated 06 Oct 2012 GNU libextractor

Screenshot
Pop 355.21
Vit 34.72

libextractor is a library used to extract meta-data from files of arbitrary type. It is designed to use helper-libraries to perform the actual extraction, and to be trivially extendable by linking against external extractors for additional file types. The goal is to provide developers of file-sharing networks, file managers, and WWW-indexing bots with a universal library to obtain meta-data about files. It includes a shell-command and bindings for Java (JNI) and Python.

Download Website Updated 15 Mar 2006 SWISH++

Screenshot
Pop 281.98
Vit 9.51

SWISH++ is a Unix-based file indexing and searching engine (typically used to index and search files on web sites). It was based on SWISH-E although SWISH++ is a complete rewrite. SWISH++ is at least 10 times faster and can handle much larger numbers of files. Additionally, it has unique features such as selective non-indexing, on-the-fly filters, user-selectable stemming, and more.

Download Website Updated 19 Mar 2009 Multivalent PDF Tools

Screenshot
Pop 212.28
Vit 2.47

The Multivalent PDF Tools is a suite of tools for manipulating PDF documents. It includes tools for compressing, uncompressing (for hand editing), obtaining metadata, splitting and merging, encrypting and decrypting, validating, imposition (aka n-up), making page images, extracting text, and full-text indexing (with Lucene). The compress tool shrinks the PDF 1.5 Reference from 13.5MB to 8MB in PDF 1.5/Acrobat 6 format and down to 5.1MB in a new proposed "Compact" format.

Download Website Updated 17 Mar 2011 Splunk

Screenshot
Pop 199.41
Vit 8.97

Splunk is an engine for machine data. Use Splunk to collect, index, and harness the fast moving machine data generated by all your applications, servers, and devices: physical, virtual, and in the cloud. Search and analyze all your real-time and historical data from one place. Splunking your machine data lets you troubleshoot problems and investigate security incidents in minutes, not hours or days. Monitor your end-to-end infrastructure to avoid service degradation or outages. Meet compliance mandates at lower cost. Correlate and analyze complex events spanning multiple systems. Gain new levels of operational visibility and intelligence for IT and the business.

Download Website Updated 15 Dec 2004 Namazu

Screenshot
Pop 196.12
Vit 3.76

Namazu is a full-text search system intended for easy use. Not only does it work as a small or medium scale Web search engine, but also as a personal search system for email or other files. Supported document types: HTML, Mail/News, MHonArc, RFC, TeX (with detex), man (with groff), Word (with wvWare), PDF (with pdftotext) and plain text.

Download Website Updated 04 Jul 2011 Emdros

Screenshot
Pop 185.49
Vit 19.68

Emdros is a corpus query system for storing and searching linguistically annotated text. It is very generic, supporting almost any kind of annotation from almost any linguistic theory. All linguistic levels of analysis are supported, including phonology, morphology, the lexical level, syntax, and discourse. The core libraries act as a middleware layer between a client and an underlying SQL database. MySQL, PostgreSQL, and SQLite are supported.

Download Website Updated 05 Mar 2007 Hyper Estraier

Screenshot
Pop 136.84
Vit 8.66

Hyper Estraier is a full-text search system. It can be used as a Web search engine, mailbox searching, etc. It features high performance searching, high scalability of target documents, a perfect recall ratio by the N-gram method, phrase searching, attribute searching, and similarity searching. Multilingualism is supported with Unicode. It is independent of file format and repository, and has a simple and powerful API.

Download Website Updated 30 Oct 2011 PyLucene

Screenshot
Pop 125.06
Vit 11.67

PyLucene is a Python extension for accessing Java Lucene from Python. Its goal is to allow use of Lucene's text indexing and searching capabilities from Python. It is designed to be API compatible with the latest version of Java Lucene.

No download Website Updated 28 Nov 2005 X-Hive/DB

Screenshot
Pop 91.64
Vit 3.78

X-Hive/DB is a powerful native XML database designed for software developers who require advanced XML data processing and storage functionality within their applications. The comprehensive X-Hive/DB Java API contains methods for storing, querying, retrieving, transforming, and publishing XML data. X-Hive/DB supports all major W3C standards, such as XQuery, XPath, DOM, XPointer, XML Schemas, and more.

Screenshot

Project Spotlight

AutoArchive

A simple backup utility.

Screenshot

Project Spotlight

Imperialism Remake

A remake of the classic strategy game Imperialism.