Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Parallel K-Means Algorithm based on Hadoop-MapReduce for Mining
Details
This work aimed to investigate the use of a parallel K-Means clustering algorithm, based on the MapReduce programming model, to improve the response time of data mining. The algorithm's performance was evaluated in terms of SpeedUp and ScaleUp. To this end, experiments were performed on a Hadoop cluster consisting of six computers with standard hardware. The clustered data are measurements from flow towers in agricultural regions and belong to Ameriflux. The experiments were performed using 3, 4, and 6 machines, respectively. The results showed that with the increase in the number of machines, there was a gain in performance, with the best time obtained using six machines, reaching a SpeedUp of 3.25. It was found that the application scales well with the equivalent increase in data size and number of machines in the cluster, achieving similar performance in the tests.
Autorentext
She is currently a doctoral student in Computer Science at the Pontifical Catholic University of Paraná (PUCPR). She obtained a master's degree in Applied Computing from the State University of Ponta Grossa in 2015. She has a bachelor's degree in Systems Analysis and Development from the Federal Technological University of Paraná (2012).
Weitere Informationen
- Allgemeine Informationen
- GTIN 09786209114083
- Sprache Englisch
- Größe H220mm x B150mm
- Jahr 2025
- EAN 9786209114083
- Format Kartonierter Einband
- ISBN 978-620-9-11408-3
- Veröffentlichung 17.10.2025
- Titel Parallel K-Means Algorithm based on Hadoop-MapReduce for Mining
- Autor Lays Helena Lopes Veloso , Luciano José Senger
- Untertitel DE
- Herausgeber Our Knowledge Publishing
- Anzahl Seiten 56
- Genre Music