Materials: [ Cód.: BOIntroOptimalSample.mlx ] [ PDF ]
This video plots the (approximate) distribution of global minima of the posterior of a function modeled as a Gaussian process of which three samples have been observed.
First, the video reviews how to obtain said posterior. Next, we review how to obtain realizations of it, and the code to repeat them and obtain the minimum of each one (this is the basic of the ”Thompson sampling” heuristic in Bayesian optimization). With this code, six thousand minimums are obtained and three histograms are plotted:
Marginal of , probability density that in a given there is a global minimum of some realization;
Marginal of , probability density that a certain is the optimal value (global minimum) that achieves some realization;
“joint” histogram approximating the joint density function over representing the probability that some realization has the global minimum at and its value is precisely .
This is the implicitly available information about the location of the optimum. The goal of Bayesian optimization is to refine that information through sampling (explore) and/or to hit the optimum of my unknown underlying function as quickly as possible (exploit).
*Link to my [ whole collection] of videos in English. Link to larger [ Colección completa] in Spanish.