Buying Options
Kindle Price: | $22.87 |
includes tax, if applicable |

Download the free Kindle app and start reading Kindle books instantly on your smartphone, tablet or computer – no Kindle device required. Learn more
Read instantly on your browser with Kindle Cloud Reader.
Using your mobile phone camera, scan the code below and download the Kindle app.

![Data Analytics with Hadoop: An Introduction for Data Scientists by [Benjamin Bengfort, Jenny Kim]](https://m.media-amazon.com/images/I/51lHM52JYWL._SX260_.jpg)
Follow the Authors
OK
Data Analytics with Hadoop: An Introduction for Data Scientists 1st Edition, Kindle Edition
Benjamin Bengfort (Author) Find all the books, read about the author, and more. See search results for this author |
Jenny Kim (Author) Find all the books, read about the author, and more. See search results for this author |
Amazon Price | New from | Used from |
Ready to use statistical and machine-learning techniques across large data sets? This practical guide shows you why the Hadoop ecosystem is perfect for the job. Instead of deployment, operations, or software development usually associated with distributed computing, you’ll focus on particular analyses you can build, the data warehousing techniques that Hadoop provides, and higher order data workflows this framework can produce.
Data scientists and analysts will learn how to perform a wide range of techniques, from writing MapReduce and Spark applications with Python to using advanced modeling and data management with Spark MLlib, Hive, and HBase. You’ll also learn about the analytical processes and data systems available to build and empower data products that can handle—and actually require—huge amounts of data.
- Understand core concepts behind Hadoop and cluster computing
- Use design patterns and parallel analytical algorithms to create distributed data analysis jobs
- Learn about data management, mining, and warehousing in a distributed context using Apache Hive and HBase
- Use Sqoop and Apache Flume to ingest data from relational databases
- Program complex Hadoop and Spark applications with Apache Pig and Spark DataFrames
- Perform machine learning techniques such as classification, clustering, and collaborative filtering with Spark’s MLlib
- ISBN-13978-1491913703
- Edition1st
- PublisherO'Reilly Media
- Publication date1 June 2016
- LanguageEnglish
- File size9218 KB
- Kindle (5th Generation)
- Kindle Keyboard
- Kindle DX
- Kindle (2nd Generation)
- Kindle (1st Generation)
- Kindle Paperwhite
- Kindle Paperwhite (5th Generation)
- Kindle Touch
Product description
About the Author
Product details
- ASIN : B01GGQKXO4
- Publisher : O'Reilly Media; 1st edition (1 June 2016)
- Language : English
- File size : 9218 KB
- Simultaneous device usage : Unlimited
- Text-to-Speech : Enabled
- Screen Reader : Supported
- Enhanced typesetting : Enabled
- X-Ray : Not Enabled
- Word Wise : Not Enabled
- Print length : 290 pages
- Best Sellers Rank: 1,009,145 in Kindle Store (See Top 100 in Kindle Store)
- 139 in Computer Programming Structured Design
- 179 in Java Computer Programming
- 505 in Java Programming
- Customer Reviews:
About the authors
Benjamin Bengfort is a Data Scientist who lives inside the beltway but ignores politics (the normal business of DC) favoring technology instead. He is currently working to finish his PhD at the University of Maryland where he studies machine learning and artificial intelligence. His lab does have robots (though this field of study is not one he favors) and, much to his chagrin, they seem to constantly arm said robots with knives and tools; presumably to pursue culinary accolades. Having seen a robot attempt to slice a tomato, Benjamin prefers his own adventures in the kitchen where he specializes in fusion French and Guyanese cuisine as well as BBQ of all types. A professional programmer by trade, a Data Scientist by avocation, Benjamin's writing pursues a diverse range of subjects from Natural Language Processing, to Data Science with Python to analytics with Hadoop.
Discover more of the author’s books, see similar authors, read author blogs, and more
Customers who bought this item also bought
Customer reviews
Top reviews from other countries


Do not recommend!

On the plus side, the best way to learn something is by doing and this book will give you plenty of opportunities to figure things out on your own. That's always a plus. But not for the authors.

