Skip to main content

Ctrl+K

Learning from data for physicists

Overview

1. Invitation to inductive inference
2. Introduction

Part I: Bayesian methods for scientific modeling

3. Overview of Part I
4. Inference and PDFs
5. More on PDFs
6. Updating via Bayes' rule
7. Error propagation
8. Bayes in practice
9. Exercises and problems for Part I

Part II: Advanced Bayesian methods

10. Overview of Part II
11. Assigning probabilities
12. Dealing with outliers
13. Bayes goes linear — History matching
- 13.1. Bayes linear methods
- 13.2. Iterative history matching
14. Model selection
15. Discrepancy Models with GPs
16. Model averaging and mixing

Part III: MCMC sampling

17. Overview of Part III
18. Intuition for MCMC
19. Details of MCMC
20. MCMC in practice
21. Advanced sampling algorithms
22. State-of-the-art sampling implementations

Part IV: Machine learning basics

23. Overview of Part IV
24. Machine learning overview
25. Logistic Regression
- 25.5. Machine Learning: First Examples
- 25.6. Exercise: Logistic Regression and Neural Networks
26. Artificial neural networks (ANNs)
27. *Convolutional neural nets
28. Problems for Part IV

Part V: Probabilistic machine learning

29. Overview of Part V
30. Bayesian neural nets
- 30.4. Demo: Variational Inference and Bayesian Neural Networks
31. Gaussian processes
32. ANNs in the large-width limit (ANNFT)
33. Bayesian Optimization
34. Dimensionality reduction and emulators
35. Problems for Part V
- 35.1. Bayesian neural networks

Backmatter

36. Bibliography
37. Guide to Jupyter Book markdown

Appendix A: Statistics

38. Notation and overview of statistics material
39. The probability measure
40. Working with probability distributions

Appendix B: Scientific modeling

41. Overview of scientific modeling material
42. Overview of modeling
43. Linear models
44. Mathematical optimization

Appendix C: Getting started

45. Overview of Getting started material
46. Setting up for interactive use of this Jupyter book
- 46.1. Using git for cloning the book repository
- 46.2. Setting up your Python enviroment
47. Jupyter notebooks and Python
48. Guides on Jupyter notebooks and Python

TALENT mini-projects

Overview of mini-projects
MP I: Parameter estimation for a toy model of an EFT
MP IIa: Model selection basics
MP IIb: How many lines?
- Mini-project IIb: How many lines are there?
MP IIIa: Bayesian optimization
MP IIIb: Bayesian Neural Networks

Repository
Open issue

.md

Critical tuning of hyperparameters

Contents

Mean absolute error loss vs. epochs

32.7. Critical tuning of hyperparameters#

../../../_images/ANNFT_critically_tuning_hyperparameters.png

../../../_images/ANNFT_criticality_nuclear.png

Mean absolute error loss vs. epochs#

../../../_images/ANNFT_ReLU_MAE_loss.png — Fig. 32.15 The mean absolute error loss vs. epochs for ReLU activation functions. Hidden layer widths are at 100 neurons, and depths are 1,2,4, and 8 hidden layers. The CICT and CINT architectures outperform the NINT, and their performance improves with depth, whereas the NINT networks stagnate in performance after 2 hidden layers.#

../../../_images/ANNFT_Tanh_MAE_loss.png — Fig. 32.16 Same as Fig. 32.15 but for Tanh activation functions.#

previous

32.6. Expansion parameter

next

32.8. Summary of results

Contents

Mean absolute error loss vs. epochs

By Christian Forssén, Dick Furnstahl, and Daniel Phillips

© Copyright 2026.