About This Book
- Learn about the fascinating functionalities of ElasticSearch like data indexing, data analysis, and dynamic mapping
- Fine-tune ElasticSearch and understand its metrics using its API and available tools, and see how it behaves in complex searches
- A hands-on tutorial that walks you through all the features of ElasticSearch in an easy-to-understand way, with examples that will help you become an expert in no time
Who This Book Is For
If you are a web developer or a user who wants to learn more about Elasticsearch, then this is the book for you. You do not need to know anything about Elasticsearch, Java, or Apache Lucene in order to use this book, though basic knowledge of databases and queries is required.
What You Will Learn
- Configure and create your own index
- Set up an analysis chain and handle multilingual data
- Use the Elasticsearch query DSL to make all kinds of queries
- Utilize filters efficiently and ensure they do not affect performance
- Implement autocomplete functionality
- Employ faceting, the aggregations framework, and similar functionalities to get more from your search and improve your clients' search experience
- Monitor your cluster state and health by using Elasticsearch APIs as well as third-party monitoring solutions
- Learn what gateway and discovery modules are, and how to properly configure them
- Control primary shards and replica rebalancing
This book begins by introducing the most commonly used Elasticsearch server functionalities, from creating your own index structure, through querying, faceting, and aggregations, and ends with cluster monitoring and problem diagnosis. As you progress through the book, you will cover topics such as starting Elasticsearch, creating a new index, and designing its proper structure. After that, you'll read about the query API that Elasticsearch exposes, as well as about filtering capabilities, aggregations, and faceting. Last but not least, you will get to know how to find similar documents by using similar functionalities and how to implement application alerts by using the prospective search functionality called percolator. Some advanced topics such as shard allocation control, gateway configuration, and how to use the discovery module will also be discussed. This book will also show you the possibilities of cluster state and health monitoring as well as how to use third-party tools.
Über den Autor und weitere Mitwirkende
Rafal Kuc is a born team leader and software developer. He currently works as a consultant and a software engineer at Sematext Group, Inc., where he concentrates on open source technologies such as Apache Lucene and Solr, Elasticsearch, and Hadoop stack. He has more than 12 years of experience in various branches of software, from banking software to ecommerce products. He focuses mainly on Java but is open to every tool and programming language that will make the achievement of his goal easier and faster. Rafal is also one of the founders of the solr.pl site, where he tries to share his knowledge and help people with the problems they face with Solr and Lucene. Also, he has been a speaker at various conferences around the world, such as Lucene Eurocon, Berlin Buzzwords, ApacheCon, and Lucene Revolution. Rafal began his journey with Lucene in 2002, and it wasn't love at first sight. When he came back to Lucene in late 2003, he revised his thoughts about the framework and saw the potential in search technologies. Then, Solr came along and this was it. He started working with Elasticsearch in the middle of 2010. Currently, Lucene, Solr, Elasticsearch, and information retrieval are his main points of interest. Rafal is also the author of Apache Solr 3.1 Cookbook, and the update to it, Apache Solr 4 Cookbook. Also, he is the author of the previous edition of this book and Mastering ElasticSearch. All these books have been published by Packt Publishing.
Marek Rogozinski is a software architect and consultant with more than 10 years of experience. He has specialized in solutions based on open source search engines such as Solr and Elasticsearch, and also the software stack for Big Data analytics including Hadoop, HBase, and Twitter Storm. He is also the cofounder of the solr.pl site, which publishes information and tutorials about Solr and the Lucene library. He is also the coauthor of some books published by Packt Publishing. Currently, he holds the position of the Chief Technology Officer in a new company, designing architecture for a set of products that collect, process, and analyze large streams of input data.