Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Practical Hadoop Ecosystem
Details
A unique more in-depth practical book on Hadoop's ecosystem to marketHadoop and Big Data are important topics to today's programmers, developmers and database admins.
In-depth book covering topics that are not covered elsewhere, and how they all work together Provides practical examples Presents one of the two most popular big data frameworks, Hadoop
Autorentext
Deepak Vohra is a coder, developer, programmer, book author, and technical reviewer.
Klappentext
This book is a practical guide on using the Apache Hadoop projects including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout and Apache Solr. From setting up the environment to running sample applications each chapter is a practical tutorial on using a Apache Hadoop ecosystem project. While several books on Apache Hadoop are available, most are based on the main projects MapReduce and HDFS and none discusses the other Apache Hadoop ecosystem projects and how these all work together as a cohesive big data development platform.
What you'll learn
How to set up environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5.
How to run a MapReduce job
How to store data with Apache Hive, Apache HBase
How to index data in HDFS with Apache Solr
How to develop a Kafka messaging system
How to develop a Mahout User Recommender SystemHow to stream Logs to HDFS with Apache Flume
How to transfer data from MySQL database to Hive, HDFS and HBase with Sqoop
How create a Hive table over Apache Solr
Inhalt
Part I. Fundamentals.- Introduction.- 1. HDFS and MapReduce.- Part II Storing & Querying.- 2. Apache Hive.- 3. Apache HBase.- Part III Bulk Transferring & Streaming.- 4. Apache Sqoop.- 5. Apache Flume.- Part IV Serializing.- 6. Apache Avro.- 7. Apache Parquet.- Part V Messaging & Indexing.- 8. Apache Kafka.- 9. Apache Solr.- 10.Apache Mahout.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09781484221983
- Genre Information Technology
- Auflage 1st edition
- Lesemotiv Verstehen
- Anzahl Seiten 444
- Größe H254mm x B178mm x T24mm
- Jahr 2016
- EAN 9781484221983
- Format Kartonierter Einband
- ISBN 1484221982
- Veröffentlichung 01.10.2016
- Titel Practical Hadoop Ecosystem
- Autor Deepak Vohra
- Untertitel A Definitive Guide to Hadoop-Related Frameworks and Tools
- Gewicht 829g
- Herausgeber Apress
- Sprache Englisch