Release Notes: Phonetic searching, joints, named entities, geo-location, renderer, and more.
Release Notes: This release fixes a problem with double encoding when a URL contains whitespace or percent signs. The Tomcat package, Selenium, and Quartz libraries have been upgraded to their latest stable versions.
Release Notes: This release fixes a few issues from the previous release candidate. It is now possible to decide whether all the content of the RSS feed is indexed in only one document or if each item is indexed individually as one document in the index. The refresh action in the runtime/system tab panel no longer changes the selected tab. The failover mechanism in the parser is protected against infinite loops. In case of a parsing error, every kind of error is reported.
Release Notes: The second release candidate fixes a bug in RC0 which could crash the Java runtime while crawling websites with malformed cookies.
Release Notes: This release adds many user interface improvements and new features. A new PDF viewer is able to highlight searched keywords on PDF pages. This powerful new feature also supports scanned PDF documents (OCR). Visio and Publisher documents are supported, and RSS feeds are indexed. A new report module provides information on the most often-searched words and which words don't return a document. Information about users and groups are extracted while browsing CIFS/SMB and FTP repositories.
Release Notes: This release fixes an issue with the optimization task when it is launched by the Web crawler. It is now possible to make a replication between two instances running on non-identical operating systems (like Windows and Linux). Several HTML parser configurations can now be set up on one index. Crawlers can run forever or only once. The locking issue has been identified and fixed. The join query has been improved.
Release Notes: This release adds more than 20 new features, including phonetic indexation, SOAP Web service, geolocated search, YouTube, Dailymotion, and Vimeo indexation (using APIs), crawl cache on filesystem or Hadoop base, smaller memory footprint, and faster execution.
Release Notes: This release features many new enhancements, performance improvements, and bugfixes, progress info in the scheduler task, asynchronous XML upload, a default stop words list, autocomplete fields and analyzer, a scheduler task that copies terms to an index, advanced spell check, a regular expression filter for the database crawler, boost query, and new scored snippets.
Release Notes: This release has new features and bugfixes. The Web crawler stores a date telling when a page has been updated. It also stores a unique key to identify when pages are identical. The robots.txt check can be disabled. The database crawler is able to crawl URLs returned by an SQL query. The pattern list and the exclusion list can be disabled. It is now possible to boost the score of terms at indexation time. Queries are able to identify similar documents. CJK languages are supported.
Release Notes: More than 50 new features and bugfixes were added. An index can be replicated on a remote server. An n-grams filter and a shingle filter provide new possibilities, such as a suggestion box, wrong spelling tolerance, and automated topic generation. A database crawler supporting join queries and external files was added. Several improvements were made to the Web crawler, such as a URL exporter, sub-domain extraction, an exclusion list, manual crawling, and a parameter filter. An API and Web interface for monitoring and supervision was added. The new audio parser offers the ability to index torrent, MP3/MP4, OGG Vorbis, FLAC, and WMA files.