12 projects tagged "Indexing/Search"
DiggerDiff is a simple program to help you keep an eye out for changes in Web pages you are interested in. It fetches each page in its list and compares the contents against an older copy of that page stored locally and displays the results. If a page has changed, a link is given to display the differences between the old and new versions so you can easily spot the changes. You can also specify multiple search and replace expressions to apply to the local copies of each page. This can be useful to minimize changed results for pages that often have predictable, small changes that are not important, and to clean up the display of the differences.
Krang SiteMap is a Krang add-on for generating robot-friendly sitemaps to aid in search engine optimization. It is designed to be easily integrated into existing Krang element libraries with minimal work. Creating and publishing a SiteMap story at the root category of a site will result in a minimally formatted story listing every published story on the site.
Sharehound is a network file systems indexer and Google-like searcher written in Java and based on Apache Lucene. It currently supports SMB file shares (i.e. MS Windows-based network shares) and FTP resources. A Web UI is used for search and crawl monitoring. An RSS feed is provided for search results that include deleted files info.
Document Library is a Web application for document management in larger organizations with a lot of documents. It supports a workflow for the submission and publication of documents. It also supports the automatic conversion of document formats on the server, such as Word to PDF. All metadata about documents can be harvested using the standard OAI-PMH protocol, thus allowing the integration of Document Library with other applications, such as Web CMSs.
RestfulBeans is a high level API for sharing objects between a server and a Java-based client. The underlying protocol is designed to expose the data consumed by a rich client to the search engines. The key properties of the protocol are compliance with the HTTP spec and REST principles, and XML representation for Java Beans with object references represented as XLink (to allow search engines to walk through the object tree). The key API property is the ability to work with remote objects as if they were local. An application using RestfulBeans is automatically capable of supplying users with URLs for any “content item” it deals with.