Back to results
Cover image for book Rapid - Apache Mahout Clustering designs

Rapid - Apache Mahout Clustering designs

By:Ashish Gupta
Publisher:Packt Publishing
Print ISBN:9781783284436
eText ISBN:9781783284443
Edition:1
Copyright:2015
Format:Reflowable

eBook Features

Instant Access

Purchase and read your book immediately

Read Offline

Access your eTextbook anytime and anywhere

Study Tools

Built-in study tools like highlights and more

Read Aloud

Listen and follow along as Bookshelf reads to you

Book Description

As more and more organizations are discovering the use of big data analytics, interest in platforms that provide storage, computation, and analytic capabilities has increased. Apache Mahout caters to this need and paves the way for the implementation of complex algorithms in the field of machine learning to better analyse your data and get useful insights into it. Starting with the introduction of clustering algorithms, this book provides an insight into Apache Mahout and different algorithms it uses for clustering data. It provides a general introduction of the algorithms, such as K-Means, Fuzzy K-Means, StreamingKMeans, and how to use Mahout to cluster your data using a particular algorithm. You will study the different types of clustering and learn how to use Apache Mahout with real world data sets to implement and evaluate your clusters. This book will discuss about cluster improvement and visualization using Mahout APIs and also explore model-based clustering and topic modelling using Dirichlet process. Finally, you will learn how to build and deploy a model for production use.

What you will learn

  • Explore clustering algorithms and cluster evaluation techniques
  • Learn different types of clustering and distance measuring techniques
  • Perform clustering on your data using KMeans clustering
  • Discover how canopy clustering is used as preprocess step for KMeans
  • Use the Fuzzy KMeans algorithm in Apache Mahout
  • Implement Streaming KMeans clustering in Mahout
  • Learn Spectral KMeans clustering implementation of Mahout

Who this book is for

• 2026 © SAU Tech Bookstore. All Rights Reserved.