
Spark in Action (Edition 1) (Paperback)
(No ratings yet)
Book Format:Paperback-Out of stock
Key item features
Summary
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. Fully updated for Spark 2.0.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Technology
Big data systems distribute datasets across clusters of machines, making it a challenge to efficiently query, stream, and interpret them. Spark can help. It is a processing system designed specifically for distributed data. It provides easy-to-use interfaces, along with the performance you need for production-quality analytics and machine learning. Spark 2 also adds improved programming APIs, better performance, and countless other upgrades.
About the Book
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. You'll get comfortable with the Spark CLI as you work through a few introductory examples. Then, you'll start programming Spark using its core APIs. Along the way, you'll work with structured data using Spark SQL, process near-real-time streaming data, apply machine learning algorithms, and munge graph data using Spark GraphX. For a zero-effort startup, you can download the preconfigured virtual machine ready for you to try the book's code.
What's Inside
About the Reader
Written for experienced programmers with some background in big data or machine learning.
About the Authors
Petar Zečević and Marko Bonaći are seasoned developers heavily involved in the Spark community.
Table of Contents
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. Fully updated for Spark 2.0.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Technology
Big data systems distribute datasets across clusters of machines, making it a challenge to efficiently query, stream, and interpret them. Spark can help. It is a processing system designed specifically for distributed data. It provides easy-to-use interfaces, along with the performance you need for production-quality analytics and machine learning. Spark 2 also adds improved programming APIs, better performance, and countless other upgrades.
About the Book
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. You'll get comfortable with the Spark CLI as you work through a few introductory examples. Then, you'll start programming Spark using its core APIs. Along the way, you'll work with structured data using Spark SQL, process near-real-time streaming data, apply machine learning algorithms, and munge graph data using Spark GraphX. For a zero-effort startup, you can download the preconfigured virtual machine ready for you to try the book's code.
What's Inside
- Updated for Spark 2.0
- Real-life case studies
- Spark DevOps with Docker
- Examples in Scala, and online in Java and Python
About the Reader
Written for experienced programmers with some background in big data or machine learning.
About the Authors
Petar Zečević and Marko Bonaći are seasoned developers heavily involved in the Spark community.
Table of Contents
- Introduction to Apache Spark
- Spark fundamentals
- Writing Spark applications
- The Spark API in depth
- Sparkling queries with Spark SQL
- Ingesting data with Spark Streaming
- Getting smart with MLlib
- ML: classification and clustering
- Connecting the dots with GraphX
- Running Spark
- Running on a Spark standalone cluster
- Running on YARN and Mesos
- Case study: real-time dashboard
- Deep learning on Spark with H2O
PART 1 - FIRST STEPS
PART 2 - MEET THE SPARK FAMILY
PART 3 - SPARK OPS
PART 4 - BRINGING IT TOGETHER
Specs
- Book formatPaperback
- Fiction/nonfictionNon-Fiction
- GenreComputing & Internet
- Publication dateNovember, 2016
- Pages472
- Number in series1
Current price is USD$67.70
Price when purchased online
Out of stock
How do you want your item?
Out of stock
Similar items you might like
Based on what customers bought
Ehrlichkeit in Der Budgetierung, (Paperback) $63.74
$6374current price $63.74Ehrlichkeit in Der Budgetierung, (Paperback)
Ausschreibungshilfe Rohbau: Standardleistungsbeschreibungen -- Baupreise -- Firmenverzeichnis, (Paperback) $69.40
$6940current price $69.40Ausschreibungshilfe Rohbau: Standardleistungsbeschreibungen -- Baupreise -- Firmenverzeichnis, (Paperback)
Annotationes, (Paperback) $52.76
$5276current price $52.76Annotationes, (Paperback)
Berechnung Mechanischer Schwingungen, (Paperback) $67.40
$6740current price $67.40Berechnung Mechanischer Schwingungen, (Paperback)
Formação de competência comunicativa, (Paperback) $71.00
$7100current price $71.00Formação de competência comunicativa, (Paperback)
CT Lab Book (Paperback) $77.49
$7749current price $77.49CT Lab Book (Paperback)
Parasitas Helmintos Veterinários: Revisão Concisa, (Paperback) $61.84
$6184current price $61.84Parasitas Helmintos Veterinários: Revisão Concisa, (Paperback)
Nódulos da tiroide, (Paperback) $49.00
$4900current price $49.00Nódulos da tiroide, (Paperback)
Glândula uropigial e uropigialectomia, (Paperback) $58.00
$5800current price $58.00Glândula uropigial e uropigialectomia, (Paperback)
Eseje na temat plci w pokoju i konflikcie, (Paperback) $54.00
$5400current price $54.00Eseje na temat plci w pokoju i konflikcie, (Paperback)
Gärungschemisches Praktikum, (Paperback) $59.99
$5999current price $59.99Gärungschemisches Praktikum, (Paperback)
Lunge Und Arbeitswelt, (Paperback) $76.25
$7625current price $76.25Lunge Und Arbeitswelt, (Paperback)
Elementarmathematik Griffbereit: Definitionen, Theoreme, Beispiele, (Paperback) $55.98 Was $69.99
$5598current price $55.98, Was $69.99$69.99Elementarmathematik Griffbereit: Definitionen, Theoreme, Beispiele, (Paperback)
Pharmacognosy and Phytochemistry: Practical Book, (Paperback) $46.00
$4600current price $46.00Pharmacognosy and Phytochemistry: Practical Book, (Paperback)
Perfil biológico de compostos heterocÃclicos, (Paperback) $48.00
$4800current price $48.00Perfil biológico de compostos heterocÃclicos, (Paperback)
Vitalmikroskopie der Haut im Auflicht, (Paperback) $69.95
$6995current price $69.95Vitalmikroskopie der Haut im Auflicht, (Paperback)
Informationstechnologie und Schulverwaltung, (Paperback) $46.00
$4600current price $46.00Informationstechnologie und Schulverwaltung, (Paperback)
Grundstudium Literaturwissenschaft Stilistik, Book 5, (Paperback) $60.84
$6084current price $60.84Grundstudium Literaturwissenschaft Stilistik, Book 5, (Paperback)
Philosophische Anfangsgründe der Quantenphysik, (Paperback) $64.61
$6461current price $64.61Philosophische Anfangsgründe der Quantenphysik, (Paperback)
Höhere Mathematik Griffbereit: Definitionen Theoreme Beispiele, (Paperback) $72.53
$7253current price $72.53Höhere Mathematik Griffbereit: Definitionen Theoreme Beispiele, (Paperback)
About this item
Product details
9781617292606. New condition. Trade paperback. Language: English. Pages: 472. Trade paperback (US). Glued binding. 472 p. Working with big data can be complex and challenging, in part because of the multiple analysis frameworks and tools required. Apache Spark is a big data processing framework perfect for analyzing near-real-time streams and discovering historical patterns in batched data sets. But Spark goes much further than other frameworks. By including machine learning and graph processing capabilities, it makes many specialized data processing platforms obsolete. Spark's unified framework and programming model significantly lowers the initial infrastructure investment, and Spark's core abstractions are intuitive for most Scala, Java, and Python developers. Spark in Action teaches readers to use Spark for stream and batch data processing. It starts with an introduction to the Spark architecture and ecosystem followed by a taste of Spark's command line interface. Readers then discover the most fundamental concepts and abstractions of Spark, particularly Resilient Distributed Datasets (RDDs) and the basic data transformations that RDDs provide. The first part of the book covers writing Spark applications using the the core APIs. Readers also learn how to work with structured data using Spark SQL, how to process near-real time data with Spark Streaming, how to apply machine learning algorithms with Spark MLlib, how to apply graph algorithms on graph-shaped data using Spark GraphX, and an introduction to Spark clustering. Key Features: * Clear introduction to Spark * Teaches how to ingest near real-time data * Gaining value from big data * Includes real-life case studies AUDIENCE Readers should be familiar with Java, Scala, or Python. No knowledge of Spark or streaming operations is assumed, but some acquaintance with machine learning is helpful. ABOUT THE TECHNOLOGY Apache Spark is a big data processing framework perfect for analyzing near-real-time streams and discovering historical patterns in batched data sets. Spark also offers machine learning and graph processing capabilities.
Summary
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. Fully updated for Spark 2.0.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Technology
Big data systems distribute datasets across clusters of machines, making it a challenge to efficiently query, stream, and interpret them. Spark can help. It is a processing system designed specifically for distributed data. It provides easy-to-use interfaces, along with the performance you need for production-quality analytics and machine learning. Spark 2 also adds improved programming APIs, better performance, and countless other upgrades.
About the Book
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. You'll get comfortable with the Spark CLI as you work through a few introductory examples. Then, you'll start programming Spark using its core APIs. Along the way, you'll work with structured data using Spark SQL, process near-real-time streaming data, apply machine learning algorithms, and munge graph data using Spark GraphX. For a zero-effort startup, you can download the preconfigured virtual machine ready for you to try the book's code.
What's Inside
About the Reader
Written for experienced programmers with some background in big data or machine learning.
About the Authors
Petar Zečević and Marko Bonaći are seasoned developers heavily involved in the Spark community.
Table of Contents
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. Fully updated for Spark 2.0.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Technology
Big data systems distribute datasets across clusters of machines, making it a challenge to efficiently query, stream, and interpret them. Spark can help. It is a processing system designed specifically for distributed data. It provides easy-to-use interfaces, along with the performance you need for production-quality analytics and machine learning. Spark 2 also adds improved programming APIs, better performance, and countless other upgrades.
About the Book
Spark in Action teaches you the theory and skills you need to effectively handle batch and streaming data using Spark. You'll get comfortable with the Spark CLI as you work through a few introductory examples. Then, you'll start programming Spark using its core APIs. Along the way, you'll work with structured data using Spark SQL, process near-real-time streaming data, apply machine learning algorithms, and munge graph data using Spark GraphX. For a zero-effort startup, you can download the preconfigured virtual machine ready for you to try the book's code.
What's Inside
- Updated for Spark 2.0
- Real-life case studies
- Spark DevOps with Docker
- Examples in Scala, and online in Java and Python
About the Reader
Written for experienced programmers with some background in big data or machine learning.
About the Authors
Petar Zečević and Marko Bonaći are seasoned developers heavily involved in the Spark community.
Table of Contents
- Introduction to Apache Spark
- Spark fundamentals
- Writing Spark applications
- The Spark API in depth
- Sparkling queries with Spark SQL
- Ingesting data with Spark Streaming
- Getting smart with MLlib
- ML: classification and clustering
- Connecting the dots with GraphX
- Running Spark
- Running on a Spark standalone cluster
- Running on YARN and Mesos
- Case study: real-time dashboard
- Deep learning on Spark with H2O
PART 1 - FIRST STEPS
PART 2 - MEET THE SPARK FAMILY
PART 3 - SPARK OPS
PART 4 - BRINGING IT TOGETHER
info:
We aim to show you accurate product information. Manufacturers, suppliers and others provide what you see here, and we have not verified it.
Specifications
Book format
Paperback
Fiction/nonfiction
Non-Fiction
Genre
Computing & Internet
Publication date
November, 2016
Warranty
Warranty information
Please be aware that the warranty terms on items offered for sale by third party Marketplace sellers may differ from those displayed in this section (if any). To confirm warranty terms on an item offered for sale by a third party Marketplace seller, please use the 'Contact seller' feature on the third party Marketplace seller's information page and request the item's warranty terms prior to purchase.
Customer ratings & reviews
0 ratings|0 reviews
This item does not have any reviews yet
