Transitivity Clustering
Details
Clustering is a computational technique for the assignment of objects into groups of similar elements. Generally, it is widely used for business data interpretation, natural language analyses, and image processing. Typical bioinformatic applications are the detection of homologous proteins and the identification of co-expressed genes. Here, we introduce Transitivity Clustering and its accompanying software framework TransClust, a homogeneous data partitioning method based on Weighted Transitive Graph Projection. It aims for unraveling hidden transitive substructures in a given similarity graph deduced from a pairwise similarity measure. Transitivity Clustering is an efficient technique that is capable of processing hundreds of thousands of data points while still being robust against outliers and noise. A single, intuitive density parameter determines the number and the size of the clusters; with provable attributes. In addition, we present extensions of the underlying graph model in order to create hierarchies and overlaps, as well as comparisons against alternative clustering approaches and real-world application cases.
Autorentext
Following his Diploma studies in mathemetics at Bielefeld University, Tobias Wittkop received in 2010 a PhD in bioinformatics from the Center for Biotechnology (CeBiTec) at Bielefeld University, Germany. Currently he works as postdoctoral researcher at the Buck Institute for Age Research in Novato, California, USA.
Weitere Informationen
- Allgemeine Informationen
- Sprache Englisch
- Herausgeber Südwestdeutscher Verlag für Hochschulschriften
- Gewicht 238g
- Untertitel Clustering biological data by unraveling hidden transitive substructures
- Autor Tobias Wittkop
- Titel Transitivity Clustering
- Veröffentlichung 19.04.2010
- ISBN 3838116542
- Format Kartonierter Einband
- EAN 9783838116549
- Jahr 2010
- Größe H220mm x B150mm x T9mm
- Anzahl Seiten 148
- GTIN 09783838116549