
Probabilistic models of vision and max-margin methods

Probabilistic models of vision and max-margin methods
摘要 It is attractive to formulate problems in computer vision and related fields in term of probabilis- tic estimation where the probability models are defined over graphs, such as grammars. The graphical struc- tures, and the state variables defined over them, give a rich knowledge representation which can describe the complex structures of objects and images. The proba- bility distributions defined over the graphs capture the statistical variability of these structures. These proba- bility models can be learnt from training data with lim- ited amounts of supervision. But learning these models suffers from the difficulty of evaluating the normaliza- tion constant, or partition function, of the probability distributions which can be extremely computationally demanding. This paper shows that by placing bounds on the normalization constant we can obtain compu- rationally tractable approximations. Surprisingly, for certain choices of loss functions, we obtain many of the standard max-margin criteria used in support vector machines (SVMs) and hence we reduce the learning to standard machine learning methods. We show that many machine learning methods can be obtained in this way as approximations to probabilistic methods including multi-class max-margin, ordinal regression, max-margin Markov networks and parsers, multiple- instance learning, and latent SVM. We illustrate this work by computer vision applications including image labeling, object detection and localization, and motion estimation. We speculate that rained by using better bounds better results can be ob- and approximations. It is attractive to formulate problems in computer vision and related fields in term of probabilis- tic estimation where the probability models are defined over graphs, such as grammars. The graphical struc- tures, and the state variables defined over them, give a rich knowledge representation which can describe the complex structures of objects and images. The proba- bility distributions defined over the graphs capture the statistical variability of these structures. These proba- bility models can be learnt from training data with lim- ited amounts of supervision. But learning these models suffers from the difficulty of evaluating the normaliza- tion constant, or partition function, of the probability distributions which can be extremely computationally demanding. This paper shows that by placing bounds on the normalization constant we can obtain compu- rationally tractable approximations. Surprisingly, for certain choices of loss functions, we obtain many of the standard max-margin criteria used in support vector machines (SVMs) and hence we reduce the learning to standard machine learning methods. We show that many machine learning methods can be obtained in this way as approximations to probabilistic methods including multi-class max-margin, ordinal regression, max-margin Markov networks and parsers, multiple- instance learning, and latent SVM. We illustrate this work by computer vision applications including image labeling, object detection and localization, and motion estimation. We speculate that rained by using better bounds better results can be ob- and approximations.
出处 《Frontiers of Electrical and Electronic Engineering in China》 CSCD 2012年第1期94-106,共13页 中国电气与电子工程前沿(英文版)
关键词 structured prediction max-margin learn- ing probabilistic models loss function structured prediction, max-margin learn- ing, probabilistic models, loss function
  • 相关文献


  • 1Pearl J. Probabilistic Reasoning in Intelligent Systems[M].Morgan Kaufmann,1988.
  • 2Manning C D,Schtüze H. Foundations of Statistical Natural Language Processing[M].Cambridge,ma:the Mit Press,1999.
  • 3Heckerman D. A tutorial on learning with Bayesian networks[A].1999.301-354.
  • 4Russell S,Norvig P. Artificial Intelligence:A modern Approach[M].Prentice-Hall,Inc,2003.
  • 5Zhu S C,Mumford D. A stochastic grammar of images[J].Foundations and Trends in Computer Graphics and Vision,2006,(04):259-362.
  • 6Tenenbaum J B,Griffiths T L,Kemp C. Theory-based Bayesian models of inductive learning and reasoning[J].Trends in Cognitive Sciences,2006,(07):309-318.doi:10.1016/j.tics.2006.05.009.
  • 7Smith N A. Linguistic Structure Prediction[A].2011.
  • 8Grenander U. Pattern Synthesis:Lectures in Pattern Theory 1[M].New York,NY Springer,1976.
  • 9Grenander U. Pattern Analysis:Lectures in Pattern Theory 2[M].New York,NY:Springer,1978.
  • 10Tenenbaum J B,Yuille A L. IPAm Summer School:The Mathematics of the Mind[M].IPAM,UCLA,2007.








使用帮助 返回顶部