139 projects tagged "Text Processing"
4Suite is a Python-based toolkit for XML and RDF application development. It features a library of integrated tools for XML processing, implementing open technologies such as DOM, RDF, XSLT, XInclude, XPointer, XLink, XPath, XUpdate, RELAX NG, and XML/SGML Catalogs. Layered upon this is an XML and RDF data repository and server, which supports multiple methods of data access, query, indexing, transformation, rich linking, and rule processing, and provides the data infrastructure of a full database system, including transactions, concurrency, access control, and management tools. It also supports HTTP, RPC, SOAP, and FTP, plus APIs in Python and XSLT.
Ciao is a complete Prolog system subsuming ISO-Prolog with a novel modular design which allows both restricting and extending the language. Ciao extensions currently include feature terms (records), higher-order, functions, constraints, objects, persistent predicates, a good base for distributed execution (agents), and concurrency. Libraries also support WWW programming, sockets, and external interfaces (C, Java, TCL/Tk, relational databases, etc.). An Emacs-based environment, a stand-alone compiler, and a toplevel shell are also provided.
DSML is the Directory Services Markup Language, an XML dialect for working with directory information. The DSML Tools provide for the querying of any LDAP directory (with search results output as DSML), the importing of DSML data into any LDAP directory, the directory-context validation of DSML (checking for illegal attributes in the entries, etc.), and the calculation of the differences (for a directory) between two DSML documents to provide an XML Diff algorithm for DSML data. This software makes all LDAP-supporting directories DSML-enabled. It can also check the integrity of DSML data, and show at a glance how two data sets, represented as DSML, differ.
JReferences is a program written in Java for managing bibliographic references in the BibTeXML format. Storage is done in a binary file database or, optionally, in a MySQL database. A PHP Web frontend is available. It can input BibTex, RIS, BibTeXML, and DocBook formated references.
JSX serializes Java objects to XML. You can persist objects, evolve them, and send them over the network and between applications. Your object data becomes human-readable and human-writable. You can test it, search it, profile it, audit it, and edit it with ordinary text and XML tools. JSX handles all POJOs and also all classes that require Java's own object serialization.
Net::Z3950::SimpleServer is a Perl module which implements the server side of the Z39.50 (information retrieval) protocol. It hides the complexity of network exchanges, packet serialization, and session handling. You are required only to implement simple callbacks to support searching and record retrieval. It is the basis of the "Zoogle" project, which is a Z39.50 gateway to the Google web index.
RADpage is a rapid application development system for dynamic, data-driven Web applications. It directly operates inside the browser without requiring a client-dependent development environment. RADpage comes as an Apache module or a CGI, includes a powerful HTML/XML like object-oriented programming language called heitml (pronounced "Hi-TML"), and more than 150 Web application components. Development is done on a fully functional application instead of requiring the edit-save-browse cycle associated with conventional client-side development environments. The RADpage editor stores application pages in readable well-formed XML or upon request in HTML for editing with HTML editors.
Sixpack is a graphical and command-line bibliography database manager written in Perl/Tk. It interacts with the supplied package 'bp', which can import and export from a number of formats including bibtex, endnote, medline, procite, and many others. It can download references directly off the Web, and open articles using external viewers. It can also interface with Emacs/XEmacs and Lyx.
WordNet® is an on-line lexical reference system whose design is inspired by current psycholinguistic theories of human lexical memory. English nouns, verbs, adjectives and adverbs are organized into synonym sets, each representing one underlying lexical concept. Different relations link the synonym sets.