Yioop! is a PHP search engine. Yioop! can be configured as either a general purpose search engine for the whole Web or it can be configured to provide search results for a set of URLs or domains. Yioop can crawl pages or can directly index archives such as ARC and WARC. It supports indexing several file formats such as HTML, Atom, PDF, DOC, PPT, RTF, RSS, XML, SVG, PNG, JPG, BMP, GIF, and sitemaps. The Yioop! crawler can be deployed on one or many machines. It supports having one or more to crawl scheduler processes, as well as multiple fetchers and mirrors. Crawling respects robots.txt including Crawl-delay. Yioop! crawls are stored in a Web archive format that is easy to move around. Crawling can be done on one machine and the results deployed elsewhere. Yioop! supports mixing of crawls. Yioop! comes with a search front end that can be localized as desired using a GUI. This GUI supports RTL languages. Management of crawls can also be done using this GUI. Yioop! can be configured in a straightforward manner to make use of file caching or memcache if available.
I, Librarian is a PDF manager or PDF organizer that allows individual researchers or a group of researchers to create an annotated collection of PDF articles. Users may build the virtual library collaboratively, thus sharing the workload of literature mining. It enables smart browsing and fast searching in reference data and PDF files, and includes an advanced tool for mining scientific literature from PubMed, PubMed Central, NASA ADS, arXiv, IEEE Xplore, and HighWire Press.
blogstrap.py is a simple, no frills blog content management system powered by Twitter's Bootstrap and web.py. It features most things you would come to expect from a simple blogging platform. You can browse posts by category or subcategory, see recent posts, and mark favorites. You can perform basic searches. It includes an About page. A basic tag system is implemented (popular tags are counted and shown). A simple comment system is available. A robust administrative interface is included where you can create and edit posts. You can upload images and include them on a Credits page, where you can properly attribute the original author. Comments can be set to on, off, or manual approval (moderated). Security has been a top priority since the beginning. Blogstrap.py has low resource usage and runs quickly on top of Lighttpd.
SQLet allows you to directly execute SQL on multiple text files, right from the Linux commandline. In one single command, you can read in text files (with or without header lines) and perform arbitrary select statements, including joins over several files. SQLet can thus replace awk or grep in some instances.
selfoss is a multipurpose RSS reader, live stream, mashup, and aggregation Web application. You can register RSS feeds, and this Web-based PHP application will continuously fetch new RSS feed items. The items will be shown in a stream. You can also add other sources, like deviantart, Twitter, or tumblr users. Attaching new sources is very easy, and you can add any source you want (e.g. IMAP email account, log files, etc.). selfoss also allows you to collect all your postings on different communities (e.g. Twitter, your blog, etc.) and show it in one place. It features a Web-based RSS reader, universal aggregator, mobile support (Android, iOS, and iPad), and support for MySQL, SQLite, and MongoDB databases. It is easy extensible with an open plugin system (write your own data connectors). It is a lightweight PHP application taking up less than 2 MB.
e-Photogallery is a Web-based photo gallery. It supports very large galleries and large files. Archive files (zip, rar, gzip, bzip2, tar) can be used for uploading photos. Most bitmap formats are supported, as well as SVG, PS, PDF, and EPS. Whole photo folders can be downloaded in various output formats and sizes. Other features include searching, tagging of photos and folders, and online resizing of thumbnails, and extensive control of access rights. EXIF data is read and used.
TinyIB is a lightweight PHP image board which emulates the functionality of 4chan. If you use MySQL or SQLite, you can use it to create an efficient setup able to handle large amounts of traffic. If you don't use a database, it can store posts as text files for a portable setup capable of running on virtually any PHP host.
Famflows is a program which tracks "flows" of energy and materials inside a household. It has an alternative approach from double entry accounting systems. In those systems (like gnucash or jgnash), the transactions are balanced in numeric value. In famflows, instead, the transactions are balanced in the conversion, but not in the numeric value.