Hadoop In Practice V413HAVseeders: 21
leechers: 1
Hadoop In Practice V413HAV (Size: 53.04 MB)
Description
Hadoop In Practice
V413HAV For More Quality Uploads : Kickass Torrents : https://kat.ph/user/V413HAV/ V413HAV On Facebook E-Book On Amazon Support The Developers. If You Like It, Buy It. Formats: EPUB, PDF Book Description Hadoop in Practice collects 85 Hadoop examples and presents them in a problem/solution format. Each technique addresses a specific task you’ll face, like querying big data using Pig or writing a log file loader. You’ll explore each problem step by step, learning both how to build and deploy that specific solution along with the thinking that went into its design. As you work through the tasks, you’ll find yourself growing more comfortable with Hadoop and at home in the world of big data. Hadoop is an open source MapReduce platform designed to query and analyze data distributed across large clusters. Especially effective for big data systems, Hadoop powers mission-critical software at Apple, eBay, LinkedIn, Yahoo, and Facebook. It offers developers handy ways to store, manage, and analyze data. Hadoop in Practice collects 85 battle-tested examples and presents them in a problem/solution format. It balances conceptual foundations with practical recipes for key problem areas like data ingress and egress, serialization, and LZO compression. You’ll explore each technique step by step, learning how to build a specific solution along with the thinking that went into it. As a bonus, the book’s examples create a well-structured and understandable codebase you can tweak to meet your own needs. This book assumes the reader knows the basics of Hadoop. What’s Inside - Conceptual overview of Hadoop and MapReduce - 85 practical, tested techniques - Real problems, real solutions - How to integrate MapReduce and R Table of Contents Part 1: Background and Fundamentals Chapter 1. Hadoop in a heartbeat Part 2: Data Logistics Chapter 2. Moving data in and out of Hadoop Chapter 3. Data serialization—working with text and beyond Part 3: Big Data Patterns Chapter 4. Applying MapReduce patterns to big data Chapter 5. Streamlining HDFS for big data Chapter 6. Diagnosing and tuning performance problems Part 4: Data Science Chapter 7. Utilizing data structures and algorithms Chapter 8. Integrating R and Hadoop for statistics and more Chapter 9. Predictive analytics with Mahout Part 5: Taming the Elephant Chapter 10. Hacking with Hive Chapter 11. Programming pipelines with Pig Chapter 12. Crunch and other technologies Chapter 13. Testing and debugging Appendix A. Related technologies Appendix B. Hadoop built-in ingress and egress tools Appendix C. HDFS dissected Appendix D. Optimized MapReduce join frameworks Book Details Paperback: 536 pages Publisher: Manning Publications (October 2012) Language: English ISBN-10: 1617290238 ISBN-13: 978-1617290237 Sharing Widget |