MATH 401
Welcome
Hello! This website contains the work I completed during the course of Math-401, my senior project and final course to complete my BS in Applied Mathematics. I completed this course during Spring 2025; Dr. Grady Wright (Professor, Boise State Department of Mathematics), served as my project advisor. During the course, I focused on the topic of Gaussian Processes (GPs), a generalization of the multivariate Gaussian distribution, and how GPs can be useful in the context of various regression problems.
You can browse my writings on different subtopics from the sidebar, or by searching using the magnifying glass in the upper-right corner of the page. This website was built using Quarto, with data analysis, inference, and visualization being conducted using the R and Stan programming languages. My poster’s layout was built from an excellent typst template, with a few formatting tweaks.
Project Summary
We began the semester by examining the connections between Gaussian Processes and Kernal Ridge Regression (KRR), reading Kanagawa et al. (2018) as a starting point. The two approaches are theoretically related in that both rely on positive-definite kernels to accomplish interpolation or data fitting. In practice, the approaches are comparable; for example, the KRR estimator is equivalent to the GP posterior mean function. However, the approaches differ in their quantification of uncertainty1, and in how they define hypothesis spaces2.
Having an introduction to the topics, I worked to gain an understanding of Gaussian Process regression by simulating univariate data. From this basic case, we moved to multidimensional problems to explore how GPs can be useful for regression problems with a spatial dimension. Early applications of GPR were in geostatistics, in which the technique (more commonly referred to as Kriging) has been used to predict the distribution of minerals over a sampled area. We concluded by applying GPR to a vector-valued outcome, specifically velocity fields. Here I drew upon the Intrinsic Coregionalization Model, discussed in Alvarez et al. (2012), which involves constructing a similarity matrix to summarize cross-output dependencies which is then combined with the kernel/Gram matrix constructed from the researcher’s training data. The page Multioutput Gaussian Process Regression explores these topics using a simulation of Hurricane Isabel’s velocity field, and also includes my attempts to conduct hyperparameter inference using the Stan programming language. I also explored multioutput/vector-valued GPR with Particle Image Velocimetry data from Harlander, Wright, and Egbers (2012) in my project’s poster.
Weekly Meetings
These are brief notes I took to summarize topics and materials I discussed with Dr. Wright during our scheduled meetings. Some of the links that reference this site may be dead; I ultimately consolidated several pages down to the set that are available today in the sidebar.
References
Footnotes
For GPs, uncertainty is represented through the posterior variance, and can be estimated/visualized via draws from the posterior distribution. With KRR, Kanagawa et al. (2018) suggest that the posterior variance can be interpreted as a “worst case error” in a Reproducing Kernel Hilbert Space (RKHS).↩︎
For KRR, it is assumed that the target function belongs to a RKHS (or can be well approximated by function in a RKHS). However, the support for GPs is not identical to that of a RKHS (Kanagawa et al. 2018, 3).↩︎