Spark in Action

Spark in Action

4.27 (11 ratings by Goodreads)
By (author) 

Free delivery worldwide

Available. Dispatched from the UK in 2 business days
When will my order arrive?


Working with big data can be complex and challenging, in partbecause of the multiple analysis frameworks and tools required.Apache Spark is a big data processing framework perfect for analyzingnear-real-time streams and discovering historical patterns in batcheddata sets. But Spark goes much further than other frameworks. Byincluding machine learning and graph processing capabilities, it makesmany specialized data processing platforms obsolete. Spark's unifiedframework and programming model significantly lowers the initialinfrastructure investment, and Spark's core abstractions are intuitive formost Scala, Java, and Python developers. Spark in Action teaches readers to use Spark for stream and batch dataprocessing. It starts with an introduction to the Spark architecture andecosystem followed by a taste of Spark's command line interface.Readers then discover the most fundamental concepts and abstractionsof Spark, particularly Resilient Distributed Datasets (RDDs) and thebasic data transformations that RDDs provide. The first part of thebook covers writing Spark applications using the the core APIs.Readers also learn how to work with structured data using Spark SQL,how to process near-real time data with Spark Streaming, how to applymachine learning algorithms with Spark MLlib, how to apply graphalgorithms on graph-shaped data using Spark GraphX, and anintroduction to Spark clustering. Key Features: * Clear introduction to Spark* Teaches how to ingest near real-time data* Gaining value from big data* Includes real-life case studies AUDIENCEReaders should be familiar with Java, Scala, or Python. No knowledge ofSpark or streaming operations is assumed, but some acquaintance withmachine learning is helpful. ABOUT THE TECHNOLOGYApache Spark is a big data processing framework perfect for analyzingnear-real-time streams and discovering historical patterns in batched datasets. Spark also offers machine learning and graph processing capabilities.
show more

Product details

  • Paperback | 468 pages
  • 188 x 234 x 22.86mm | 796g
  • Manning Publications
  • New York, United States
  • English
  • 1617292605
  • 9781617292606
  • 344,535

About Petar Zecevic

Petar Zecevic is a CTO at SV Group. During the last 14 years he hasworked on various projects as a Java developer, team leader, consultant andsoftware specialist. He is the founder and, with Marko, organizer of popularSpark@Zg meetup group. Marko Bonaci has worked with Java for 13years.He works Sematext as a Spark developer and consultant. Before that,he was team lead for SV Group's IBM Enterprise Content Managementteam.
show more

Rating details

11 ratings
4.27 out of 5 stars
5 55% (6)
4 27% (3)
3 9% (1)
2 9% (1)
1 0% (0)
Book ratings by Goodreads
Goodreads is the world's largest site for readers with over 50 million reviews. We're featuring millions of their reader ratings on our book pages to help you find your new favourite book. Close X