Back to results
Cover image for book Elasticsearch for Hadoop

Elasticsearch for Hadoop

By:Vishal Shukla
Publisher:Packt Publishing
Print ISBN:9781785288999
eText ISBN:9781785282249
Edition:1
Copyright:2015
Format:Reflowable

eBook Features

Instant Access

Purchase and read your book immediately

Read Offline

Access your eTextbook anytime and anywhere

Study Tools

Built-in study tools like highlights and more

Read Aloud

Listen and follow along as Bookshelf reads to you

Integrate Elasticsearch into Hadoop to effectively visualize and analyze your data

Key Features

    Book Description

    The Hadoop ecosystem is a de-facto standard for processing terra-bytes and peta-bytes of data. Lucene-enabled Elasticsearch is becoming an industry standard for its full-text search and aggregation capabilities. Elasticsearch-Hadoop serves as a perfect tool to bridge the worlds of Elasticsearch and Hadoop ecosystem to get best out of both the worlds. Powered with Kibana, this stack makes it a cakewalk to get surprising insights out of your massive amount of Hadoop ecosystem in a flash. In this book, you'll learn to use Elasticsearch, Kibana and Elasticsearch-Hadoop effectively to analyze and understand your HDFS and streaming data. You begin with an in-depth understanding of the Hadoop, Elasticsearch, Marvel, and Kibana setup. Right after this, you will learn to successfully import Hadoop data into Elasticsearch by writing MapReduce job in a real-world example. This is then followed by a comprehensive look at Elasticsearch essentials, such as full-text search analysis, queries, filters and aggregations; after which you gain an understanding of creating various visualizations and interactive dashboard using Kibana. Classifying your real-world streaming data and identifying trends in it using Storm and Elasticsearch are some of the other topics that we'll cover. You will also gain an insight about key concepts of Elasticsearch and Elasticsearch-hadoop in distributed mode, advanced configurations along with some common configuration presets you may need for your production deployments. You will have “Go production checklist” and high-level view for cluster administration for post-production. Towards the end, you will learn to integrate Elasticsearch with other Hadoop eco-system tools, such as Pig, Hive and Spark.

    What you will learn

    • Set up the ElasticsearchHadoop environment
    • Import HDFS data into Elasticsearch with MapReduce jobs
    • Perform fulltext search and aggregations efficiently using Elasticsearch
    • Visualize data and create interactive dashboards using Kibana
    • Check and detect anomalies in streaming data using Storm and Elasticsearch
    • Inject and classify realtime streaming data into Elasticsearch
    • Get productionready for ElasticsearchHadoop based projects
    • Integrate with Hadoop ecosystem such as Pig, Storm, Hive, and Spark

    Who this book is for

    This book is targeted at Java developers with basic knowledge on Hadoop. No prior Elasticsearch experience is expected.

    • 2026 © SAU Tech Bookstore. All Rights Reserved.