Apache Lucene 5 - New Features and Improvements for Apache Solr and Elasticsearch

Search
06/02/2015 - 16:30 to 17:10
Stage 2
long talk (40 min)
Intermediate

Session abstract: 

This talk will present the improvements and new features, but also some incompatible changes in the Lucene 5 release:

Lucene 5 will focus on data safety: The move to Java 7 was completed. Lucene now uses all the brand new features (NIO.2) of Java 7 to make the indexing process more stable and resulting indexes durable. Checksums are used during merging to prevent bugs in the underlying JVM or data corruption due to networking errors (e.g., while distributing indexes during recovery in Elasticsearch) to persist in newly created index segments.

The previous major version, Lucene 4 was a major release that introduced index codecs. In Lucene 5, the API around codecs will be cleaned up and will likely get more stable. There are also new features, like a common FilterCache that can be reused by Solr and Elasticsearch.

In parallel Apache Solr 5 was released, the first version that will now work as a server out of the box, so Solr is no longer exposed as a webapp. Init.d scripts are included and configuration is managed easier through Zookeeper.

This talk will give an overview over the background of these changes and how to make the best out of it.

Video: 

Slide: 

Corporate-Design: Extragestaltung, Margarethe Hausstätter
Ilustration: cyan, Berlin