RSS 12 projects tagged "Text Processing"

Download Website Updated 27 Jan 2014 GNU awk

Screenshot
Pop 614.64
Vit 26.15

The awk utility interprets a special-purpose programming language that makes it possible to handle simple data-reformatting jobs with just a few lines of code.

Download Website Updated 29 Oct 2008 WebGlimpse

Screenshot
Pop 174.42
Vit 11.35

WebGlimpse is a scalable, feature-rich search engine for indexing your Web site or any collection of local and remote sites you choose. Features include customizable output formats, custom ranking/ordering of hits, fuzzy matching, boolean queries, a Web administration interface for multiple archives, logging of queries, caching of results, and more. Localized search interfaces are provided in multiple languages including Spanish, German, French, Italian, Norwegian, Finnish, Russian, Hebrew, and others. It supports 3rd party filters for indexing PDF, Word, and Excel files. It is free for academic and most nonprofit users.

Download Website Updated 30 Jul 2007 WordNet

Screenshot
Pop 73.12
Vit 2.58

WordNet® is an on-line lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets.

Download Website Updated 24 Sep 2001 securitylog2html

Screenshot
Pop 16.31
Vit 67.72

Securitylog2html is a script developed in AWK for filtering ipchains logs to generate HTML reports.

Download Website Updated 10 Dec 2001 uni-xml

Screenshot
Pop 24.62
Vit 67.15

Uni-XML is a project to replace standard Unix configuration files with XML. It also includes providing XML Schema files for the XML files, and XSLT stylesheets to convert the XML files to the configuration files.

Download No website Updated 11 Dec 2013 white_dune

Screenshot
Pop 530.06
Vit 136.79

white_dune is a graphical VRML97/X3DV editor, simple NURBS/Superformula 3D modeller, animation tool, and VRML97/X3DV commandline compiler in development. VRML97 (Virtual Reality Modeling Language) is the ISO standard for displaying 3D data over the Web via browser plugins ("HTML for realtime 3D"). X3DV is the direct successor of VRML97. VRML97 and X3DV have support for animation, real-time interaction, and multimedia (images, movies, and sounds). white_dune can read, create, and display VRML97/X3DV files and let the user change the scenegraph/fields. It also has support for stereoscopic view via "quadbuffer"-capable stereo visuals, and support for 3D input devices like a joystick, spaceball, or magnetic tracker.

Download Website Updated 31 Jul 2008 Ganglia

Screenshot
Pop 338.17
Vit 8.28

Ganglia is a scalable distributed monitoring system for high-performance computing systems such as clusters and grids. It is based on a hierarchical design targeted at federations of clusters. Ganglia is currently in use on over 500 clusters around the world and has scaled to handle clusters with 2000 nodes.

Download No website Updated 30 Oct 2002 fti2svg.pl

Screenshot
Pop 30.00
Vit 1.00

fti2svg.pl converts SGI Irix .fti vector-based icons to .svg images suitable for Nautilus or any other SVG-capable program.

Download Website Updated 14 Jan 2010 Doodle

Screenshot
Pop 165.99
Vit 6.28

Doodle is a desktop search engine for Linux. It searches your hard drive for files using pattern matching on meta-data. It extracts file-format specific meta-data using libextractor and builds a suffix tree to index the files. The index can then be searched rapidly. It is similar to locate, but can take advantage of information such as ID3 tags. It is possible to do full-text indexing using the appropriate libextractor plugins. It also supports using FAM to keep the database up-to-date.

Download Website Updated 03 Oct 2005 cz2cz tools

Screenshot
Pop 17.03
Vit 1.85

cz2cz tools is software for converting texts between various charset encodings that are used in the Czech language. The most important feature is autodetection of the most-used encodings (UTF-8, ISO-8859-2, Win-1250, cp850, and Kamenickych). There are both console non-interactive (text-based) and interactive (curses) applications. cz2cz also allows you to convert characters with diacritics to TeX (LaTeX) conventions. The interactive part of cz2cz tools can be used for quick manual complementing of diacritics to texts.

Screenshot

Project Spotlight

JSONMinify

A JSON+C minifier.

Screenshot

Project Spotlight

(R)?ex

A tool to ease the execution of commands on multiple remote servers.