
Advanced Analytics with Spark: Patterns for Learning from Data at Scale by Sandy Ryza
(No ratings yet)
Key item features
Apache Spark is emerging as one of the most popular technologies for performing analytics on huge datasets, and this practical guide shows you how to harness Spark's power for approaching a variety of analytics problems. You'll learn how to apply common techniques, such as classification, clustering, collaborative filtering, anomaly detection, dimensionality reduction, and Monte Carlo simulation to fields such as genomics, security, and finance."Advanced Analytics with Spark" supplies complete implementations that analyze large public datasets, and acts as an introduction to using these techniques and other best practices in Spark programming.Become familiar with the Spark programming model and ecosystemLearn general approaches in data scienceDiscover which machine learning tools make sense for particular problemsAcquire code from GitHub that can be adapted to many usesThis book will interest both data science professionals and aspiring data scientists, students studying learning techniques for analyzing large datasets, and scientists interested in using Spark as a research tool., In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques-classification, collaborative filtering, and anomaly detection among others-to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find these patterns useful for working on your own data applications. Patterns include: Recommending music and the Audioscrobbler data set Predicting forest cover with decision trees Anomaly detection in network traffic with K-means clu...
Specs
- Book formatPaperback
- Fiction/nonfictionNon-Fiction
- GenreNonfiction
- Pages260
- Edition1st Edition
- Original languagesEnglish
Current price is USD$24.38
Price when purchased online
- Free shipping
Free 30-day returns
How do you want your item?
Columbus, 43215
Arrives between Apr 10 - Apr 13
|Sold and shipped by Alibris Books
4.564131668558456 stars out of 5, based on 10572 seller reviews(4.6)10572 seller reviews
Free 30-day returns
About this item
Product details
ADVANCED ANALYTICS WITH SPARK
Apache Spark is emerging as one of the most popular technologies for performing analytics on huge datasets, and this practical guide shows you how to harness Spark's power for approaching a variety of analytics problems. You'll learn how to apply common techniques, such as classification, clustering, collaborative filtering, anomaly detection, dimensionality reduction, and Monte Carlo simulation to fields such as genomics, security, and finance."Advanced Analytics with Spark" supplies complete implementations that analyze large public datasets, and acts as an introduction to using these techniques and other best practices in Spark programming.Become familiar with the Spark programming model and ecosystemLearn general approaches in data scienceDiscover which machine learning tools make sense for particular problemsAcquire code from GitHub that can be adapted to many usesThis book will interest both data science professionals and aspiring data scientists, students studying learning techniques for analyzing large datasets, and scientists interested in using Spark as a research tool., In this practical book, four Cloudera data scientists present a set of self-contained patterns for performing large-scale data analysis with Spark. The authors bring Spark, statistical methods, and real-world data sets together to teach you how to approach analytics problems by example. You'll start with an introduction to Spark and its ecosystem, and then dive into patterns that apply common techniques-classification, collaborative filtering, and anomaly detection among others-to fields such as genomics, security, and finance. If you have an entry-level understanding of machine learning and statistics, and you program in Java, Python, or Scala, you'll find these patterns useful for working on your own data applications. Patterns include: Recommending music and the Audioscrobbler data set Predicting forest cover with decision trees Anomaly detection in network traffic with K-means clu...
info:
We aim to show you accurate product information. Manufacturers, suppliers and others provide what you see here, and we have not verified it. Â
Specifications
Book format
Paperback
Fiction/nonfiction
Non-Fiction
Genre
Nonfiction
Pages
260
Warranty
Warranty information
Please be aware that the warranty terms on items offered for sale by third party Marketplace sellers may differ from those displayed in this section (if any). To confirm warranty terms on an item offered for sale by a third party Marketplace seller, please use the 'Contact seller' feature on the third party Marketplace seller's information page and request the item's warranty terms prior to purchase.
Similar items you might like
Based on what customers bought
Raise Your Frequency: Aligning with Higher Consciousness, (Paperback) $18.63
$1863current price $18.63Raise Your Frequency: Aligning with Higher Consciousness, (Paperback)
Diagnosing and Changing Organizational Culture: Based on the Competing Values Framework, (Paperback) $24.49
$2449current price $24.49Diagnosing and Changing Organizational Culture: Based on the Competing Values Framework, (Paperback)
Meaningful Graphs: Converting Data Into Informative Excel Charts, (Paperback) $16.49
$1649current price $16.49Meaningful Graphs: Converting Data Into Informative Excel Charts, (Paperback)
Becoming a Resonant Leader: Develop Your Emotional Intelligence, Renew Your Relationships, Sustain Your Effectiveness, (Paperback) $20.19 Was $25.19
$2019current price $20.19, Was $25.19$25.19Becoming a Resonant Leader: Develop Your Emotional Intelligence, Renew Your Relationships, Sustain Your Effectiveness, (Paperback)
44.8 out of 5 Stars. 4 reviewsLearning-Driven Business, The : How to Develop an Organizational Learning Ecosystem (Hardcover) $10.96
$1096current price $10.96Learning-Driven Business, The : How to Develop an Organizational Learning Ecosystem (Hardcover)
Pre-Owned Think STATS: Exploratory Data Analysis (Paperback) 1491907339 9781491907337 $9.74
2 optionsAvailable in additional 2 options$974current price $9.74Pre-Owned Think STATS: Exploratory Data Analysis (Paperback) 1491907339 9781491907337
Pre-Owned Learning Spark: Lightning-Fast Big Data Analysis (Paperback) 1449358624 9781449358624 $4.00
3 optionsAvailable in additional 3 options$400current price $4.00Pre-Owned Learning Spark: Lightning-Fast Big Data Analysis (Paperback) 1449358624 9781449358624
Pre-Owned Head First Data Analysis: A Learner's Guide to Big Numbers, Statistics, and Good Decisions (Paperback) 0596153937 9780596153939 $5.98
2 optionsAvailable in additional 2 options$598current price $5.98Pre-Owned Head First Data Analysis: A Learner's Guide to Big Numbers, Statistics, and Good Decisions (Paperback) 0596153937 9780596153939
Pre-Owned Data Pipelines Pocket Reference: Moving and Processing Data for Analytics (Paperback) 1492087831 9781492087830 $11.99
3 optionsAvailable in additional 3 options$1199current price $11.99Pre-Owned Data Pipelines Pocket Reference: Moving and Processing Data for Analytics (Paperback) 1492087831 9781492087830
Data Mesh: Delivering Data-Driven Value at Scale, (Paperback) $53.02
$5302current price $53.02Data Mesh: Delivering Data-Driven Value at Scale, (Paperback)
Pre-Owned Spark: The Definitive Guide: Big Data Processing Made Simple (Paperback) 1491912219 9781491912218 $18.35 Was $27.39
3 optionsAvailable in additional 3 options$1835current price $18.35, Was $27.39$27.39Pre-Owned Spark: The Definitive Guide: Big Data Processing Made Simple (Paperback) 1491912219 9781491912218
Pre-Owned Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking (Paperback) 1449361323 9781449361327 $11.99 Was $14.99
3 optionsAvailable in additional 3 options$1199current price $11.99, Was $14.99$14.99Pre-Owned Data Science for Business: What You Need to Know about Data Mining and Data-Analytic Thinking (Paperback) 1449361323 9781449361327
Pre-Owned Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale (Paperback) 1491901632 9781491901632 $4.28
3 optionsAvailable in additional 3 options$428current price $4.28Pre-Owned Hadoop: The Definitive Guide: Storage and Analysis at Internet Scale (Paperback) 1491901632 9781491901632
Pre-Owned Basic Crocheting (Spiral-bound) 0811733165 9780811733168 $9.41
$941current price $9.41Pre-Owned Basic Crocheting (Spiral-bound) 0811733165 9780811733168
Boredom Busters: Transform Worksheets, Lectures, and Grading into Engaging, Meaningful Learning Experiences, (Paperback) $23.56
$2356current price $23.56Boredom Busters: Transform Worksheets, Lectures, and Grading into Engaging, Meaningful Learning Experiences, (Paperback)
Pre-Owned Building EventDriven Microservices: Leveraging Organizational Data at Scale Paperback $24.29
3 optionsAvailable in additional 3 options$2429current price $24.29Pre-Owned Building EventDriven Microservices: Leveraging Organizational Data at Scale Paperback
Pre-Owned Learning SQL: Generate, Manipulate, and Retrieve Data (Paperback) 1492057614 9781492057611 $24.99
3 optionsAvailable in additional 3 options$2499current price $24.99Pre-Owned Learning SQL: Generate, Manipulate, and Retrieve Data (Paperback) 1492057614 9781492057611
A Guide to the Project Management Body of Knowledge (PMBOK Guide)Fifth Edition $30.59 Was $34.58
$3059current price $30.59, Was $34.58$34.58A Guide to the Project Management Body of Knowledge (PMBOK Guide)Fifth Edition
134.4 out of 5 Stars. 13 reviewsThe Oak Island Code: Deciphering the origins and movements of the treasure with data science, (Hardcover) $23.80
$2380current price $23.80The Oak Island Code: Deciphering the origins and movements of the treasure with data science, (Hardcover)
Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Pr, (Paperback) $46.93
$4693current price $46.93Learning Pandas 2.0: A Comprehensive Guide to Data Manipulation and Analysis for Data Scientists and Machine Learning Pr, (Paperback)
Customer ratings & reviews
0 ratings|0 reviews
This item does not have any reviews yet
