Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Big Data Made Easy
Details
Many corporations are finding that the size of their data sets are outgrowing the capability of their systems to store and process them. The data is becoming too big to manage and use with traditional tools. The solution: implementing a big data system.As Big Data Made Easy: A Working Guide to the Complete Hadoop Toolset shows, Apache Hadoop offers a scalable, fault-tolerant system for storing and processing data in parallel. It has a very rich toolset that allows for storage (Hadoop), configuration (YARN and ZooKeeper), collection (Nutch and Solr), processing (Storm, Pig, and Map Reduce), scheduling (Oozie), moving (Sqoop and Avro), monitoring (Chukwa, Ambari, and Hue), testing (Big Top), and analysis (Hive).The problem is that the Internet offers IT pros wading into big data many versions of the truth and some outright falsehoods born of ignorance. What is needed is a book just like this one: a wide-ranging but easily understood set of instructions to explain where to get Hadoop tools, what they can do, how to install them, how to configure them, how to integrate them, and how to use them successfully. And you need an expert who has worked in this area for a decadesomeone just like author and big data expert Mike Frampton.Big Data Made Easy approaches the problem of managing massive data sets from a systems perspective, and it explains the roles for each project (like architect and tester, for example) and shows how the Hadoop toolset can be used at each system stage. It explains, in an easily understood manner and through numerous examples, how to use each tool. The book also explains the sliding scale of tools available depending upon data size and when and how to use them. Big Data Made Easy shows developers and architects, as well as testers and project managers, how to:Store big dataConfigure big dataProcess big dataSchedule processesMove data among SQL and NoSQL systemsMonitor dataPerform big data analytics Report on big data processes and projectsTest big data systemsBig Data Made Easy also explains the best part, which is that this toolset is free. Anyone can download it andwith the help of this bookstart to use it within a day. With the skills this book will teach you under your belt, you will add value to your company or client immediately, not to mention your career.
Big Data Made Easy: A Working Guide to the Complete Hadoop Toolset is an introduction for developers and architects anyone else interested in big data to using the Apache Hadoop toolset. It includes a description of all tool capabilities as well as in-depth instructions to build and test a working system.
Autorentext
Mike Frampton has been in the IT industry since 1990, working in many roles (tester, developer, support, QA), and in many sectors ( telecoms, banking, energy, insurance). He has also worked for major corporations and banks, including IBM, HP, and JPMorgan Chase. The owner of Semtech Solutions, an IT/Big Data consultancy, Mike currently lives by the beach in Paraparaumu, New Zealand, with his wife and son.
Inhalt
The Problem with Data
Storing and Configuring Data with Hadoop, Yarn, and ZooKeeper
Collecting Data with Nutch and Solr
Processing Data Map Reduce
Scheduling Using Oozie
Moving Data with Sqoop and Avro
Monitoring the System with Chukwa, Ambari, and Hue
Analyzing and Querying Data with Hive and MongoDB
Reporting with Hadoop and Other Software
Testing with Big Top
Hadoop Present and Future
Weitere Informationen
- Allgemeine Informationen
- GTIN 09781484200957
- Sprache Englisch
- Auflage 1st edition
- Größe H235mm x B191mm x T22mm
- Jahr 2014
- EAN 9781484200957
- Format Kartonierter Einband
- ISBN 1484200950
- Veröffentlichung 24.12.2014
- Titel Big Data Made Easy
- Autor Michael Frampton
- Untertitel A Working Guide to the Complete Hadoop Toolset
- Gewicht 730g
- Herausgeber Apress
- Anzahl Seiten 392
- Lesemotiv Verstehen
- Genre Informatik