RSS 21 projects tagged "search engine"

Download Website Updated 06 Apr 2014 OpenSearchServer

Screenshot
Pop 569.58
Vit 89.63

OpenSearchServer is a powerful, enterprise-class, search engine program. Using its Web user interface, crawlers (Web, file, database, etc.), and REST/RESTFul API, you can integrate advanced full-text search capabilities into your application.

No download Website Updated 02 Dec 2013 Yioop!

Screenshot
Pop 161.95
Vit 12.21

Yioop! is a PHP search engine. Yioop! can be configured as either a general purpose search engine for the whole Web or it can be configured to provide search results for a set of URLs or domains. Yioop can crawl pages or can directly index archives such as ARC and WARC. It supports indexing several file formats such as HTML, Atom, PDF, DOC, PPT, RTF, RSS, XML, SVG, PNG, JPG, BMP, GIF, and sitemaps. The Yioop! crawler can be deployed on one or many machines. It supports having one or more to crawl scheduler processes, as well as multiple fetchers and mirrors. Crawling respects robots.txt including Crawl-delay. Yioop! crawls are stored in a Web archive format that is easy to move around. Crawling can be done on one machine and the results deployed elsewhere. Yioop! supports mixing of crawls. Yioop! comes with a search front end that can be localized as desired using a GUI. This GUI supports RTL languages. Management of crawls can also be done using this GUI. Yioop! can be configured in a straightforward manner to make use of file caching or memcache if available.

No download Website Updated 18 Oct 2013 Trokam

Screenshot
Pop 22.09
Vit 13.47

Trokam is a project to develop a search engine based on collaborative Web crawling and easily-replicable servers, to enable organizations and companies to collaborate with dedicated servers connected to a load balancer on collaborative development.

No download Website Updated 18 Oct 2013 Libcolumbus

Screenshot
Pop 40.12
Vit 2.14

Libcolumbus is a small error tolerant search engine designed to deal with noisy data and typos. It will power the searches in the next generation of Ubuntu's HUD system as well as other searches. It has a fast implementation of the Levenshtein distance algorithm, which allows it to correct errors such as added and dropped letters (e.g. 'bar' -> 'bard'), changed letters ('ctr' -> 'car') and translations ('acr' -> 'car'). It also allows the user to customize the error values. Libcolumbus is designed to be small, efficient and easy to embed. It is programmed in C++ but also provides C and Python APIs.

No download Website Updated 04 Sep 2013 Web Search Script

Screenshot
Pop 23.28
Vit 15.02

Web Search Script can search the contents of a text file stored on your server by editing the script source code, or the file location can be left blank to allow your users to enter a URL to a text document stored on a remote Web server. Search results will be listed by line number, with contextual information surrounding the search term. Clicking search results will scroll the complete text in the right pane to the result's location in the complete text.

Download Website Updated 29 Jan 2013 Language Popularity Index

Screenshot
Pop 25.38
Vit 1.01

The Language Popularity Index tool is a fully automatic, transparent tool to measure the popularity of programming languages on the Internet.

Download Website Updated 05 Jul 2012 Arch Search Engine

Screenshot
Pop 17.44
Vit 35.88

Arch is an extension of Apache Nutch (a popular, highly scalable general purpose search engine) for intranet search. It includes blind test evaluation tools for comparing to other search engines. Arch has many features critical for corporate environments, such as document level security.

No download No website Updated 10 May 2012 RestPose

Screenshot
Pop 20.83
Vit 2.05

RestPose is a search engine. It is designed to take a set of documents and then, when given a query, to return ranked lists of documents which are a good match for that query. RestPose manages a set of internal indexes and provides an interface (over HTTP, in a fairly RESTful style, using JSON as the main transfer format) which allows documents to be submitted and removed from indexes, and which allows searches to be performed.

No download No website Updated 02 Apr 2012 mycelium

Screenshot
Pop 15.65
Vit 27.29

mycelium is an information retrieval system. It aspires to be an alternative to Nutch / Lucene. It uses MongoDB as a storage engine.

Download Website Updated 21 Nov 2011 Seeks

Screenshot
Pop 72.59
Vit 4.89

Seeks is a Web search proxy, meta-search engine, and real-time P2P pattern matching network for social Web search. Its specific purpose is to regroup users whose queries are similar so they can share both the query results and their experience with these results. On this basis, Seeks allows true real-time, decentralized Web search to emerge. In the long term, there is no need for Web crawlers and third-party Web indexes. Most importantly, Seeks is intended to become a flagship for a fair, transparent, user controlled machinery for searching the Web over the Internet.

Screenshot

Project Spotlight

validator.php

A script to automatically validate HTML code generated by a PHP script.

Screenshot

Project Spotlight

Geany

A fast and lightweight IDE using GTK2.