Skip to content

Commit 3d361f2

Browse files
hftsoipre-commit-ci[bot]MilesCranmer
authored
docs: add symbolfit paper (#750)
* add symbolfit paper * [pre-commit.ci] auto fixes from pre-commit.com hooks for more information, see https://pre-commit.ci * docs: move image to other repo * docs: fix yaml syntax --------- Co-authored-by: pre-commit-ci[bot] <66853113+pre-commit-ci[bot]@users.noreply.github.com> Co-authored-by: Miles Cranmer <[email protected]>
1 parent e8bbc5c commit 3d361f2

File tree

1 file changed

+24
-0
lines changed

1 file changed

+24
-0
lines changed

docs/papers.yml

Lines changed: 24 additions & 0 deletions
Original file line numberDiff line numberDiff line change
@@ -245,6 +245,30 @@ papers:
245245
abstract: "How can we find interpretable, domain-appropriate models of natural phenomena given some complex, raw data such as images? Can we use such models to derive scientific insight from the data? In this paper, we propose some methods for achieving this. In particular, we implement disentangled representation learning, sparse deep neural network training and symbolic regression, and assess their usefulness in forming interpretable models of complex image data. We demonstrate their relevance to the field of bioimaging using a well-studied test problem of classifying cell states in microscopy data. We find that such methods can produce highly parsimonious models that achieve ~98% of the accuracy of black-box benchmark models, with a tiny fraction of the complexity. We explore the utility of such interpretable models in producing scientific explanations of the underlying biological phenomenon."
246246
image: https://raw.githubusercontent.com/MilesCranmer/PySR_Docs/master/images/cell_state_classification.jpg
247247
date: 2024-02-05
248+
- title: "SymbolFit: Automatic Parametric Modeling with Symbolic Regression"
249+
authors:
250+
- Ho Fung Tsoi (1)
251+
- Dylan Rankin (1)
252+
- Cecile Caillol (2)
253+
- Miles Cranmer (3)
254+
- Sridhara Dasu (4)
255+
- Javier Duarte (5)
256+
- Philip Harris (6, 7)
257+
- Elliot Lipeles (1)
258+
- Vladimir Loncar (6, 8)
259+
affiliations:
260+
1: University of Pennsylvania
261+
2: European Organization for Nuclear Research (CERN)
262+
3: University of Cambridge
263+
4: University of Wisconsin-Madison
264+
5: University of California San Diego
265+
6: Massachusetts Institute of Technology
266+
7: Institute for Artificial Intelligence and Fundamental Interactions
267+
8: Institute of Physics Belgrade
268+
link: https://arxiv.org/abs/2411.09851
269+
abstract: "We introduce SymbolFit, a framework that automates parametric modeling by using symbolic regression to perform a machine-search for functions that fit the data, while simultaneously providing uncertainty estimates in a single run. Traditionally, constructing a parametric model to accurately describe binned data has been a manual and iterative process, requiring an adequate functional form to be determined before the fit can be performed. The main challenge arises when the appropriate functional forms cannot be derived from first principles, especially when there is no underlying true closed-form function for the distribution. In this work, we address this problem by utilizing symbolic regression, a machine learning technique that explores a vast space of candidate functions without needing a predefined functional form, treating the functional form itself as a trainable parameter. Our approach is demonstrated in data analysis applications in high-energy physics experiments at the CERN Large Hadron Collider (LHC). We demonstrate its effectiveness and efficiency using five real proton-proton collision datasets from new physics searches at the LHC, namely the background modeling in resonance searches for high-mass dijet, trijet, paired-dijet, diphoton, and dimuon events. We also validate the framework using several toy datasets with one and more variables."
270+
image: https://raw.githubusercontent.com/MilesCranmer/PySR_Docs/refs/heads/master/images/symbolfit_sampling.png
271+
date: 2024-11-15
248272
- title: "The automated discovery of kinetic rate models – methodological frameworks"
249273
authors:
250274
- Miguel Ángel de Carvalho Servia (1)

0 commit comments

Comments
 (0)