RSS 39 projects tagged "Text Processing"

Download Website Updated 04 Mar 2014 Sanzang

Screenshot
Pop 312.55
Vit 9.83

Sanzang is a compact and simple cross-platform machine translation system. It is especially useful for translating from the CJK languages (Chinese, Japanese, and Korean), and it is very suitable for working with ancient and otherwise difficult texts. Unlike most other machine translation systems, Sanzang is small and approachable. Any user can develop his or her own translation rules, and these rules are simply stored in a text file and applied at runtime.

Download Website Updated 28 Jun 2012 Xapian and Omega

Screenshot
Pop 402.51
Vit 16.24

Xapian is a search engine library, scalable to collections containing hundreds of millions of documents. It's written in C++ with bindings for Perl, Python, PHP, Java, Tcl, C#, Ruby, and Lua. It is a highly adaptable toolkit that allows developers to easily add advanced indexing and search facilities to their own applications. It supports the Probabilistic Information Retrieval model and also a rich set of boolean query operators. Omega is a Web search application built upon the Xapian library. It can index a Web server's document tree (including HTML, PDF, OpenOffice, MS Word/Excel/Powerpoint/Works, WordPerfect, RTF, PS, etc.), or data exported from arbitrary sources (e.g. SQL databases).

No download No website Updated 01 Jan 2011 Winnow

Screenshot
Pop 18.06
Vit 34.82

Winnow efficiently trains and operates any number of unique Bayesian (Naive Bayes) classifiers on large sets of content. It has very high performance and works with very small training and unbalanced training sets. It has been used to power an innovative Web feed reader that uses smart tags, which learn and find the content you want to see, from more sources than you can follow with traditional feed readers. It works particularly well with Ruby and Ruby on Rails.

Download Website Updated 23 Nov 2009 Syck

Screenshot
Pop 82.27
Vit 4.74

Syck is a YAML parser library that is designed to load data into scripting languages. Extensions for Ruby, PHP, and Python are included.

Download Website Updated 24 Feb 2009 Rainpress

Screenshot
Pop 29.15
Vit 1.56

Rainpress is a CSS compressor. It can be used either as a library in Ruby programs or as a standalone executable. Rainpress doesn't apply common compression algorithms like gzip; rather, it removes unecessary characters and replaces long attributes with shorter equivalents.

Download Website Updated 08 Feb 2009 deplate

Screenshot
Pop 60.27
Vit 5.87

deplate converts wiki-like markup to LaTeX (standard classes, koma, dramatist, sweave), HTML/PHP (single page, chunked/website, HTML, or s5-based slideshow), DocBook (article, book, man/ref page), and really plain text. Currently supported input formats are viki and Ruby's rdoc. The viki markup supports footnotes, citations, index, table of contents, embedded LaTeX for mathematics, integration with R for dynamically generated figures and tables, and more. Output can be customized via page templates.

Download Website Updated 19 Jan 2009 SiSU

Screenshot
Pop 195.06
Vit 18.74

SiSU (Structured information, Serialized Units) is a lightweight markup based, text structuring and publishing framework (that features granular search). With minimal markup of a plaintext file, it produces: plain-text, HTML, XHTML, XML, ODF, LaTeX, PDF, and populates an SQL database at an object/paragraph level for granular searches. Prepare documents using your text editor of choice, then use SiSU to generate the desired output formats. SiSU is controlled from the command line.

No download Website Updated 13 Jan 2009 Ramaze

Screenshot
Pop 10.05
Vit 1.54

Ramaze is a simple, light, and modular Web application framework for use with Ruby. Ramaze aims to adhere to the KISS and POLS principles. Ramaze has minimal dependencies, and is very modular, allowing you to use your own choice of ORM (DB interface and modelling library), JavaScript library, and templating library. Ramaze is thoroughly documented, with plenty of examples and helpers, and is developed with Behavior Driven Design (BDD), with a complete set of code specifications.

Download Website Updated 26 Oct 2008 rwdgutenberg text reader

Screenshot
Pop 33.27
Vit 3.86

rwdgutenberg is a book reading tool. It can find text files, build lists of text files, auto read texts, has context sensitive help, and can submit bug reports with one click. Additional applets can be downloaded. It includes the rwdtinker framework, and should not require any other downloads. It only requires that Ruby be installed, and should work on all platforms.

Download Website Updated 17 Oct 2008 rwdhypernote

Screenshot
Pop 23.24
Vit 4.08

rwdhypernote is a hierarchical note editor. It uses a directory structure for notes, and can record internal links and Web links. It has context-sensitive help. Additional applets can be downloaded. The GUI interface used is RubyWebDialogs, which runs through a Web browser. Therefore, it is completely cross-platform. This is part of the Tinker framework using Ruby, so applets can be added and removed.

Screenshot

Project Spotlight

Razer device configuration tool

A Razer device configuration tool.

Screenshot

Project Spotlight

ZABBIX

An enterprise-class distributed monitoring solution.