Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Pro Microsoft HDInsight
Details
Pro Microsoft HDInsight is a complete guide to deploying and using Apache Hadoop on the Microsoft Windows Azure Platforms. The information in this book enables you to process enormous volumes of structured as well as non-structured data easily using HDInsight, which is Microsoft's own distribution of Apache Hadoop. Furthermore, the blend of Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) offerings available through Windows Azure lets you take advantage of Hadoop's processing power without the worry of creating, configuring, maintaining, or managing your own cluster.
With the data explosion that is soon to happen, the open source Apache Hadoop Framework is gaining traction, and it benefits from a huge ecosystem that has risen around the core functionalities of the Hadoop distributed file system (HDFS™) and Hadoop Map Reduce. Pro Microsoft HDInsight equips you with the knowledge, confidence, and technique to configure and manage this ecosystem on Windows Azure. The book is an excellent choice for anyone aspiring to be a data scientist or data engineer, putting you a step ahead in the data mining field.
- Guides you through installation and configuration of an HDInsight cluster on Windows Azure
- Provides clear examples of configuring and executing Map Reduce jobs
Helps you consume data and diagnose errors from the Windows Azure HDInsight Service
Pro Microsoft HDInsight is a complete guide for deploying and using Apache Hadoop on the Microsoft Windows Azure Platform.
Autorentext
Debarchan Sarkar is a SQL Server engineer hailing from the City of Joy, Calcutta, India, currently relocated in Bangalore, India. His passion for processing and architecting data grew with the introduction of the Business Intelligence Suite introduced in SQL Server 2005. Since then he has fallen in love with the diversity of data mining techniques. He is a very active community member in the BI domain, and has published MSDN articles, Whitepapers, blogs, and videos in the ETL space. Debarchan has also delivered advanced training on ETL and Data Warehousing with great success to several Microsoft Premium partners and clients. He has been working on HDInsight since the initial days of its internal releases, helping to test and improve the product in multiple dimensions.Klappentext
Pro Microsoft HDInsight is a complete guide to deploying and using Apache Hadoop on the Microsoft Windows Azure Platforms. The information in this book enables you to process enormous volumes of structured as well as non-structured data easily using HDInsight, which is Microsoft's own distribution of Apache Hadoop. Furthermore, the blend of Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) offerings available through Windows Azure lets you take advantage of Hadoop's processing power without the worry of creating, configuring, maintaining, or managing your own cluster. With the data explosion that is soon to happen, the open source Apache Hadoop Framework is gaining traction, and it benefits from a huge ecosystem that has risen around the core functionalities of the Hadoop distributed file system (HDFS(TM)) and Hadoop Map Reduce. Pro Microsoft HDInsight equips you with the knowledge, confidence, and technique to configure and manage this ecosystem on Windows Azure. The book is an excellent choice for anyone aspiring to be a data scientist or data engineer, putting you a step ahead in the data mining field.Guides you through installation and configuration of an HDInsight cluster on Windows Azure Provides clear examples of configuring and executing Map Reduce jobs Helps you consume data and diagnose errors from the Windows Azure HDInsight Service
Inhalt
- Introducing HDInsight
- Understanding Windows Azure HDInsight Service
- Provisioning Your HDInsight Service Cluster
- Automating HDInsight Cluster Provisioning
- Submitting Jobs to Your HDInsight Cluster
- Exploring the HDInsight Name Node
- Using Windows Azure HDInsight Emulator
- Accessing HDInsight over Hive and ODBC
- Consuming HDInsight from Self-Service BI Tools
- Integrating HDInsight with SQL Server Integration Services
- Logging in HDInsight
- Troubleshooting Cluster Deployments
- Troubleshooting Job Failures
Weitere Informationen
- Allgemeine Informationen
- GTIN 09781430260554
- Sprache Englisch
- Auflage 1st edition
- Größe H235mm x B191mm x T15mm
- Jahr 2014
- EAN 9781430260554
- Format Kartonierter Einband
- ISBN 1430260556
- Veröffentlichung 21.02.2014
- Titel Pro Microsoft HDInsight
- Autor Debarchan Sarkar
- Untertitel Hadoop on Windows
- Gewicht 514g
- Herausgeber Apress
- Anzahl Seiten 272
- Lesemotiv Verstehen
- Genre Informatik