ICML 2009

ICML 2009 Proceedings Front Matter: icml09-frontmatter.pdf

ICML 2009 Proceedings Table of Contents: icml09-tableofcontents.pdf

ICML 2009 Bibtex file: icml2009.bib

ICML 2009 Proceedings

Information Theoretic Measures for Clusterings Comparison: Is a Correction for Chance Necessary?

Xuan Vinh Nguyen, Julien Epps and James Bailey

paper ID: 10

Information theoretic based measures form a fundamental class of similarity measures for comparing clusterings, beside the class of pair-counting based and set-matching based measures. In this paper, we discuss the necessity of correction for chance for information theoretic based measures for clusterings comparison. We observe that the baseline for such measures, i.e. average value under random partitioning of a data set, does not take on a constant value, and tends to have larger variation when the ratio between the number of data points and the number of clusters is small. This effect is similar in some other non-information theoretic based measures such as the well-known Rand Index. Assuming a hypergeometric model of randomness, we derive the analytical formula for the expected mutual information value between a pair of clusterings, and then propose the adjusted version for several popular information theoretic based measures. Some examples are given to demonstrate the need and usefulness of the adjusted measures.

For Participants

Programme

For authors

Organization

For students

Misc

ICML 2009 Proceedings

Information Theoretic Measures for Clusterings Comparison: Is a Correction for Chance Necessary?

A majorization-minimization algorithm for (multiple) hyperparameter learning

Robust Feature Extraction via Information Theoretic Learning

Probabilistic Dyadic Data Analysis with Local and Global Consistency

Identifying Suspicious URLs: An Application of Large-Scale Online Learning

An Efficient Sparse Metric Learning in High-Dimensional Space via $\ell_1$-Penalized Log-Determinant Regularization

Exploiting Sparse Markov and Covariance Structure in Multiresolution Models

Dynamic Analysis of Multiagent Q-learning with e-greedy Exploration

Sparse Higher Order Conditional Random Fields for improved sequence labeling

Efficient learning algorithms for changing environments

Ranking Interesting Subgroups

PAC-Bayesian Learning of Linear Classifiers

Approximate Inference for Planning in Stochastic Relational Worlds

Solution Stability in Linear Programming Relaxations: Graph Partitioning and Unsupervised Learning

Supervised Learning from Multiple Experts: Whom to trust when everyone lies a bit

Generalization Analysis of Listwise Learning-to-Rank Algorithms

Gradient Descent with Sparsification: an iterative algorithm for sparse recovery with restricted isometry property

Curriculum Learning

Efficient Euclidean Projections in Linear Time

Learning Dictionaries of Stable Autoregressive Models for Audio Scene Analysis

Non-Monotonic Feature Selection

EigenTransfer: A Unified Framework for Transfer Learning

Boosting with Structural Sparsity

More Generality in Efficient Multiple Kernel Learning

An Accelerated Gradient Method for Trace Norm Minimization

Accounting for Burstiness in Topic Models

Ranking with Ordered Weighted Pairwise Classification

Blockwise Coordinate Descent Procedures for the Multi-task Lasso, with Applications to Neural Semantic Basis Discovery

Polyhedral Outer Approximations with Application to Natural Language Parsing

Factored Conditional Restricted Boltzmann Machines for Modeling Motion Style

Discriminative $k$ metrics

A Novel Lexicalized HMM-based Learning Framework for Web Opinion Mining

Graph Construction and b-Matching for Semi-Supervised Learning

Matrix Updates for Perceptron Training of Continuous Density Hidden Markov Models

Geometry-aware Metric Learning

Prototype Vector Machine for Large Scale Semi-supervised Learning

Rule Learning with Monotonicity Constraints

Transfer Learning for Collaborative Filtering via a Rating-Matrix Generative Model

Proto-Predictive Representation of States with Simple Recurrent Temporal-Difference Networks

Large-scale Deep Unsupervised Learning using Graphics Processors

Deep Learning from Temporal Coherence in Video

Hoeffding and Bernstein Races for Selecting Policies in Evolutionary Direct Policy Search

Boosting products of base classifiers

Learning Instance Specific Distances Using Metric Propagation

Decision Tree and Instance-Based Learning for Label Ranking

Fast Evolutionary Maximum Margin Clustering

Structure Learning of Bayesian Networks using Constraints

Tractable Nonparametric Bayesian Inference in Poisson Processes with Gaussian Process Intensities

On Sampling-based Approximate Spectral Decomposition

Archipelago: Nonparametric Bayesian Semi-Supervised Learning

Good Learners for Evil Teachers

Stochastic Methods for L1 Regularized Loss Minimization

Nonparametric Factor Analysis with Beta Process Priors

Accelerated Gibbs Sampling for the Indian Buffet Process

Robot Trajectory Optimization using Approximate Inference

On Primal and Dual Sparsity of Markov Networks

Learning structurally consistent undirected probabilistic graphical models

Regression by dependence minimization and its application to causal inference

Learning Spectral Graph Transformations for Link Prediction

GAODE and HAODE: Two Proposals based on AODE to Deal with Continuous Variables

Sparse Gaussian Graphical Models with Unknown Block Structure

Robust Bounds for Classification via Selective Sampling

Optimistic Initialization and Greediness Lead to Polynomial Time Learning in Factored MDPs

Learning Nonlinear Dynamic Models

Convex Variational Bayesian Inference for Large Scale Generalized Linear Models

Unsupervised Search-based Structured Prediction

Large Margin Training for Hidden Markov Models with Partially Observed States

The Adaptive k-Meteorologists Problem and Its Application to Structure Learning and Feature Selection in Reinforcement Learning

Nonparametric Estimation of the Precision-Recall Curve

Trajectory Prediction: Learning to Map Situations to Robot Trajectories

Partially Supervised Feature Selection with Regularized Linear Models

A Least Squares Formulation for a Class of Generalized Eigenvalue Problems in Machine Learning

A Scalable Framework for Discovering Coherent Co-clusters in Noisy Data