Wir verwenden Cookies und Analyse-Tools, um die Nutzerfreundlichkeit der Internet-Seite zu verbessern und für Marketingzwecke. Wenn Sie fortfahren, diese Seite zu verwenden, nehmen wir an, dass Sie damit einverstanden sind. Zur Datenschutzerklärung.
Mining For Significant Rare Events From Large Databases
Details
In this work, we present a novel algorithm for extracting valuable knowledge from large databases. Rare events are difficult to mine due to very little support they possess. Our algorithm, SARG (Significant Association Rule Generator) helps us to mine for significant patterns (including rare events) from large databases by defining the support fraction per cell in the contingency table instead of per the entire contingency table. It uses a combination of both support confidence and chi square statistic framework for mining significant patterns from vast raw data. In this algorithm, we introduce the notion of critical attribute and critical attribute value which are passed as input parameters to the SARG algorithm to make the mining process more selective. We ran our algorithm against a huge medical file provided by the Cleveland Clinic Foundation, Cleveland, OH. We compared the results of SARG algorithm with the results produced by Brin s chi square algorithm. Some of the results produced by SARG are unknown medical facts that are not produced by Brin's chi square algorithm.
Autorentext
Dr. Suman Katragadda has a dual masters degree in computer science and statistics and a doctoral degree in statistics. His area of specialization is on multivariate mixed data mining and fraud detection with applications to insurance, banking, stock trading, evidence based medicine, and market reseach.
Weitere Informationen
- Allgemeine Informationen
- GTIN 09783838340258
- Sprache Englisch
- Größe H220mm x B150mm x T8mm
- Jahr 2010
- EAN 9783838340258
- Format Kartonierter Einband
- ISBN 3838340256
- Veröffentlichung 21.01.2010
- Titel Mining For Significant Rare Events From Large Databases
- Autor Suman Katragadda
- Untertitel Discover the unknown facts
- Gewicht 203g
- Herausgeber LAP LAMBERT Academic Publishing
- Anzahl Seiten 124
- Genre Sozialwissenschaften, Recht & Wirtschaft