Yioop! is a PHP search engine. Yioop! can be configured as either a general purpose search engine for the whole Web or it can be configured to provide search results for a set of URLs or domains. Yioop can crawl pages or can directly index archives such as ARC and WARC. It supports indexing several file formats such as HTML, Atom, PDF, DOC, PPT, RTF, RSS, XML, SVG, PNG, JPG, BMP, GIF, and sitemaps. The Yioop! crawler can be deployed on one or many machines. It supports having one or more to crawl scheduler processes, as well as multiple fetchers and mirrors. Crawling respects robots.txt including Crawl-delay. Yioop! crawls are stored in a Web archive format that is easy to move around. Crawling can be done on one machine and the results deployed elsewhere. Yioop! supports mixing of crawls. Yioop! comes with a search front end that can be localized as desired using a GUI. This GUI supports RTL languages. Management of crawls can also be done using this GUI. Yioop! can be configured in a straightforward manner to make use of file caching or memcache if available.
DBeaver is a universal database manager and SQL Client. It supports MySQL, PostgreSQL, Oracle, DB2, MSSQL, Sybase, Mimer, HSQLDB, SQLite, Derby, and any database that has a JDBC driver. It is a GUI program that allows you to view the structure of a database, execute SQL queries and scripts, browse and export table data, handle BLOB/CLOB values, modify database meta objects, etc. It has a native UI (provided by the Eclipse SWT library), great performance, and relatively low memory consumption.
SOFA is a statistics, analysis, and reporting program with an emphasis on ease of use, learning as you go, and beautiful output. SOFA can connect directly to your database and lets you display results in an attractive format ready to share or put in a spreadsheet. SOFA will help you learn as you go, whether you are a student, business analyst, or researcher.
TinyIB is a lightweight PHP image board which emulates the functionality of 4chan. If you use MySQL or SQLite, you can use it to create an efficient setup able to handle large amounts of traffic. If you don't use a database, it can store posts as text files for a portable setup capable of running on virtually any PHP host.
I, Librarian is a PDF manager or PDF organizer that allows individual researchers or a group of researchers to create an annotated collection of PDF articles. Users may build the virtual library collaboratively, thus sharing the workload of literature mining. It enables smart browsing and fast searching in reference data and PDF files, and includes an advanced tool for mining scientific literature from PubMed, PubMed Central, NASA ADS, arXiv, IEEE Xplore, and HighWire Press.
SQLet allows you to directly execute SQL on multiple text files, right from the Linux commandline. In one single command, you can read in text files (with or without header lines) and perform arbitrary select statements, including joins over several files. SQLet can thus replace awk or grep in some instances.
Klen-library is a program for creating electronic books. It consists of three modules, for editing books, reading books, and creating tests. It offers password protection of every book and the editing mode. The print function makes it self-sufficient, as it allows creation of books without using third-party text editors. It uses SQLite, so you don't have to install a database server. It can export to the XWB, HTML, TXT, PDF, PostScript, and RTF formats, and can import from XWB, HTML, and TXT.
Fuzzy machine learning framework is a library and a GUI front-end for machine learning using intuitionistic fuzzy data. The approach is based on the intuitionistic fuzzy sets and the possibility theory. Further characteristics are fuzzy features and classes; numeric, enumeration features and features based on linguistic variables; user-defined features; derived and evaluated features; classifiers as features for building hierarchical systems; automatic refinement in case of dependent features; incremental learning; fuzzy control language support; object-oriented software design with extensible objects and automatic garbage collection; generic data base support through ODBC; text I/O and HTML output; an advanced graphical user interface based on GTK+; and examples of use.