5 projects tagged "Text Processing"
WebGlimpse is a scalable, feature-rich search engine for indexing your Web site or any collection of local and remote sites you choose. Features include customizable output formats, custom ranking/ordering of hits, fuzzy matching, boolean queries, a Web administration interface for multiple archives, logging of queries, caching of results, and more. Localized search interfaces are provided in multiple languages including Spanish, German, French, Italian, Norwegian, Finnish, Russian, Hebrew, and others. It supports 3rd party filters for indexing PDF, Word, and Excel files. It is free for academic and most nonprofit users.
Perform a variety of functions on PDF files. These include (but are not limited to) adding page numbers, stamps, merging, overlay, extracting pages, extract/fill-in field data, extracting/replacing/resizing images, highlighting text, extracting/editing bookmarks, SMTP emailing the output, adding/extracting attachments, resizing pages, and encryption/password protection. This is a compiled command line program that runs stand alone.