
Hadoop in Practice : Includes 104 Techniques (Edition 2) (Paperback)
Key item features
Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Book
It's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date coverage of Hadoop available.
Readers need to know a programming language like Java and have basic familiarity with Hadoop.
What's Inside
- Thoroughly updated for Hadoop 2
- How to write YARN applications
- Integrate real-time technologies like Storm, Impala, and Spark
- Predictive analytics using Mahout and RR
- Readers need to know a programming language like Java and have basic familiarity with Hadoop.
About the Author
Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects.
Table of Contents
- Hadoop in a heartbeat
- Introduction to YARN
- Data serialization—working with text and beyond
- Organizing and optimizing data in HDFS
- Moving data into and out of Hadoop
- Applying MapReduce patterns to big data
- Utilizing data structures and algorithms at scale
- Tuning, debugging, and testing
- SQL on Hadoop
- Writing a YARN application
PART 1 BACKGROUND AND FUNDAMENTALS
PART 2 DATA LOGISTICS
PART 3 BIG DATA PATTERNS
PART 4 BEYOND MAPREDUCE
Specs
- Book formatPaperback
- Fiction/nonfictionNon-Fiction
- GenreComputing & Internet
- Publication dateFebruary, 2015
- Pages512
- Edition2
- Free shipping
Free 30-day returns
How do you want your item?
More seller options (1)
About this item
Product details
Major updates to key technologies
AUDIENCE
Readers should be familiar with Hadoop and have experience programming in Java or another OOP language.
For developers working with big data, it's not enough to have a theoretical understanding of Hadoop. They need to solve real challenges like analyzing real-time streams, moving data securely between storage systems, and managing large-scale clusters. The Hadoop ecosystem is constantly growing, and it's important they keep up with the new technologies and practices to stay productive and future-proof data systems.
Hadoop in Practice, Second Edition provides over 100 tested, instantly-useful techniques that will help conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN, real-time use cases, and integrating Kafka, Storm, and Spark with Hadoop. There's also a new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere. RETAIL SELLING POINTS Practical up-to-date coverage Over 100 practical, battle-tested Hadoop techniquesMajor updates to key technologies
AUDIENCE
Readers should be familiar with Hadoop and have experience programming in Java or another OOP language.
ABOUT THE TECHNOLOGY
Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers organizations efficient ways to store, manage, and analyze data.
Hadoop in Practice, Second Edition provides over 100 tested, instantly useful techniques that will help you conquer big data, using Hadoop. This revised new edition covers changes and new features in the Hadoop core architecture, including MapReduce 2. Brand new chapters cover YARN and integrating Kafka, Impala, and Spark SQL with Hadoop. You'll also get new and updated techniques for Flume, Sqoop, and Mahout, all of which have seen major new versions recently. In short, this is the most practical, up-to-date coverage of Hadoop available anywhere.
Purchase of the print book includes a free eBook in PDF, Kindle, and ePub formats from Manning Publications.
About the Book
It's always a good time to upgrade your Hadoop skills! Hadoop in Practice, Second Edition provides a collection of 104 tested, instantly useful techniques for analyzing real-time streams, moving data securely, machine learning, managing large-scale clusters, and taming big data using Hadoop. This completely revised edition covers changes and new features in Hadoop core, including MapReduce 2 and YARN. You'll pick up hands-on best practices for integrating Spark, Kafka, and Impala with Hadoop, and get new and updated techniques for the latest versions of Flume, Sqoop, and Mahout. In short, this is the most practical, up-to-date coverage of Hadoop available.
Readers need to know a programming language like Java and have basic familiarity with Hadoop.
What's Inside
- Thoroughly updated for Hadoop 2
- How to write YARN applications
- Integrate real-time technologies like Storm, Impala, and Spark
- Predictive analytics using Mahout and RR
- Readers need to know a programming language like Java and have basic familiarity with Hadoop.
About the Author
Alex Holmes works on tough big-data problems. He is a software engineer, author, speaker, and blogger specializing in large-scale Hadoop projects.
Table of Contents
- Hadoop in a heartbeat
- Introduction to YARN
- Data serialization—working with text and beyond
- Organizing and optimizing data in HDFS
- Moving data into and out of Hadoop
- Applying MapReduce patterns to big data
- Utilizing data structures and algorithms at scale
- Tuning, debugging, and testing
- SQL on Hadoop
- Writing a YARN application
PART 1 BACKGROUND AND FUNDAMENTALS
PART 2 DATA LOGISTICS
PART 3 BIG DATA PATTERNS
PART 4 BEYOND MAPREDUCE
Specifications
Book format
Fiction/nonfiction
Genre
Publication date
Warranty
Warranty information
Similar items you might like
Based on what customers bought
Guide to Programming and Algorithms Using R, (Paperback) $49.87
$4987current price $49.87Guide to Programming and Algorithms Using R, (Paperback)
Breaking into AI: The Ultimate Interview Playbook, (Paperback) $49.99
$4999current price $49.99Breaking into AI: The Ultimate Interview Playbook, (Paperback)
The Parinama Method: Transform Everything - A Practical and Philosophical Guide, (Paperback) $47.39
$4739current price $47.39The Parinama Method: Transform Everything - A Practical and Philosophical Guide, (Paperback)
The ID CaseBook: Case Studies in Instructional Design, (Paperback) $48.96
$4896current price $48.96The ID CaseBook: Case Studies in Instructional Design, (Paperback)
Dynamodb Applied Design Patterns, (Paperback) $48.29
$4829current price $48.29Dynamodb Applied Design Patterns, (Paperback)
Professionnaliser La Formation Des Professeurs de Fle (Paperback) $59.06
$5906current price $59.06Professionnaliser La Formation Des Professeurs de Fle (Paperback)
Psychological Digital Practice: The Basics and Beyond, (Paperback) $26.15
$2615current price $26.15Psychological Digital Practice: The Basics and Beyond, (Paperback)
Hadoop Blueprints, (Paperback) $48.29
$4829current price $48.29Hadoop Blueprints, (Paperback)
Pattern Recognition: Ideas in Practice, (Paperback) $54.99
$5499current price $54.99Pattern Recognition: Ideas in Practice, (Paperback)
Déduplication efficace des données dans Hadoop, (Paperback) $47.00
$4700current price $47.00Déduplication efficace des données dans Hadoop, (Paperback)
Mastering Cocos2d Game Development, (Paperback) $48.29
$4829current price $48.29Mastering Cocos2d Game Development, (Paperback)
Practical Hadoop Security, (Paperback) $49.23
$4923current price $49.23Practical Hadoop Security, (Paperback)
Cryptography for Everyone, (Paperback) $49.99
$4999current price $49.99Cryptography for Everyone, (Paperback)
Business Research: A Practical Guide for Students, (Paperback) $51.80 Was $65.99
$5180current price $51.80, Was $65.99$65.99Business Research: A Practical Guide for Students, (Paperback)
Machine Learning: Theory and Practice, (Paperback) $47.99
$4799current price $47.99Machine Learning: Theory and Practice, (Paperback)
Deep Learning in Practice, (Paperback) $58.02
$5802current price $58.02Deep Learning in Practice, (Paperback)
Mastering PostCSS for Web Design, (Paperback) $48.29
$4829current price $48.29Mastering PostCSS for Web Design, (Paperback)
IoT Fundamentals with a Practical Approach, (Paperback) $51.19 Was $59.95
$5119current price $51.19, Was $59.95$59.95IoT Fundamentals with a Practical Approach, (Paperback)


