Invited Lecture Series:
Metric and Kernel Learning
|
|
| Speaker: |
Dr. Inderjit S. Dhillon
|
| When: |
Friday, Nov 14th, 2008 |
| Time: |
2:00pm |
| Where: |
ECS 243 |
|
Abstract:
Many problems in data mining and machine learning, both in supervised and unsupervised learning, depend crucially on the choice of an appropriate distance or similarity measure. The appropriateness of such a measure can ultimately dictate the success or failure of the learning algorithm, but its choice is highly problem and application dependent. As a result, there have been several recent data-driven approaches that attempt to learn distance measures.
In this talk, I will present a new approach to metric and kernel learning using the Log-Determinant divergence. The Log-Determinant divergence has previously been used in statistics, where it is called Stein's Loss, and in numerical optimization, where it has been used to show superlinear convergence of the well-known BFGS quasi-Newton method. Our metric learning approach has the following desirable properties: (a) the metric learning problem is equivalent to a kernel learning problem, (b) the method can generalize to unseen data points, (c) the method can improve upon an input metric that may be provided by an application expert, and (d) the algorithm does not require any expensive eigenvector computation or semi-definite programming. I will present results on semi-supervised clustering, nearest neighbor error reporting for software programs, and image classification.
This is joint work with Jason Davis, Prateek Jain, Brian Kulis and Suvrit Sra.
Biography:
Inderjit Dhillon is an Associate Professor of Computer Sciences at The University of Texas at Austin. His main research interests are in numerical analysis, data mining and machine learning. He is best known for his work on computational algorithms in these areas, in particular on eigenvalue computations, clustering, co-clustering matrix approximations, and metric/kernel learning. Software based on his research on eigenvalue computations is now part of all state-of-the-art numerical software libraries. Inderjit received an NSF Career Award in 2001, a University Research Excellence Award in 2005, and the SIAG/LA Prize in 2006. Along with his students, he has received several best paper awards at leading data mining and machine learning conferences. Inderjit received his B.Tech. degree from the Indian Institute of Technology at Bombay, and Ph.D. from the University of California at Berkeley. He is a member of the Association for Computing Machinery, the Institute of Electrical and Electronics Engineers, and the Society for Industrial and Applied Mathematics.
|