Advances in Knowledge Discovery and Data Mining, Part I: by Mohammed J. Zaki, Jeffrey Xu Yu, B. Ravindran, Vikram Pudi

By Mohammed J. Zaki, Jeffrey Xu Yu, B. Ravindran, Vikram Pudi

This publication constitutes the court cases of the 14th Pacific-Asia convention, PAKDD 2010, held in Hyderabad, India, in June 2010.

Show description

Read Online or Download Advances in Knowledge Discovery and Data Mining, Part I: 14th Pacific-Asia Conference, PAKDD 2010, Hyderabat, India, June 21-24, 2010, Proceedings PDF

Similar data mining books

Data Mining in Agriculture (Springer Optimization and Its Applications)

Data Mining in Agriculture represents a entire attempt to supply graduate scholars and researchers with an analytical textual content on info mining options utilized to agriculture and environmental similar fields. This ebook offers either theoretical and sensible insights with a spotlight on proposing the context of every info mining method fairly intuitively with abundant concrete examples represented graphically and with algorithms written in MATLAB®.

Data Mining: Foundations and Practice

This ebook comprises helpful reviews in info mining from either foundational and sensible views. The foundational experiences of knowledge mining will help to put a pretty good beginning for info mining as a systematic self-discipline, whereas the sensible reports of knowledge mining could lead to new info mining paradigms and algorithms.

Big Data Analytics and Knowledge Discovery: 17th International Conference, DaWaK 2015, Valencia, Spain, September 1-4, 2015, Proceedings

This ebook constitutes the refereed complaints of the seventeenth foreign convention on facts Warehousing and information Discovery, DaWaK 2015, held in Valencia, Spain, September 2015. The 31 revised complete papers provided have been conscientiously reviewed and chosen from ninety submissions. The papers are prepared in topical sections similarity degree and clustering; facts mining; social computing; heterogeneos networks and knowledge; facts warehouses; circulation processing; purposes of massive info research; and massive information.

Understanding Complex Urban Systems: Integrating Multidisciplinary Data in Urban Models

This e-book is dedicated to the modeling and realizing of advanced city platforms. This moment quantity of realizing complicated city structures specializes in the demanding situations of the modeling instruments, pertaining to, e. g. , the standard and volume of information and the choice of an acceptable modeling process. it's intended to help city decision-makers—including municipal politicians, spatial planners, and citizen groups—in opting for a suitable modeling process for his or her specific modeling standards.

Extra info for Advances in Knowledge Discovery and Data Mining, Part I: 14th Pacific-Asia Conference, PAKDD 2010, Hyderabat, India, June 21-24, 2010, Proceedings

Example text

Naturally, the performance of these VAT-based methods is greatly dependent of the quality of the VAT images. However, while VAT has been widely used for cluster analysis, it is usually only effective at highlighting cluster tendency in data sets that contain compact well-separated clusters. Many practical applications involve data sets with highly irregular structure, which invalidate this assumption. In this paper, we propose an improved VAT (iVAT) approach to generating RDIs that combines VAT with a path-based distance transform.

2690, pp. 195–202. Springer, Heidelberg (2003) 4. : Some new indices of cluster validity. IEEE Transactions on System, Man and Cybernetics 28(3), 301–315 (1998) 5. : Estimating the number of clusters in a dataset via the gap statistics. Journal of the Royal Statistical Society. Series B, Statistical Methodology 63(2), 411–423 (2001) 6. : A dendrite method for cluster analysis. Communications in Statistics 3(1), 1–27 (1974) 7. : Indices of partition fuzziness and the detection of clusters in large sets.

4GHz CPU and 2GB memory running Windows XP. 1 Test Datasets Six synthetic data sets were used in our experiments, whose scatter plots are shown in Figure 4(a). These data sets involve irregular data structures, in which an obvious cluster centroid for each group is not necessarily available. , Iris, Vote and Multiple Features. The Face data set [26] contains 1755 images of 3 individuals, each of which was down-sampled to 30 × 40 pixels. The Gene data set [27] is a 194 × 194 matrix consisting of pairwise dissimilarities of a set of gene products from 3 protein families.

Download PDF sample

Rated 4.33 of 5 – based on 45 votes