SRBench Competition 2022: Interpretable Symbolic Regression for Data Science

SRBench hosted its first competition at the GECCO 2022 conference in Boston, MA. This competition seeks to distill algorithmic design choices and improve the practice of symbolic regression by evaluating the submitted symbolic regression methods on previously unseen, real-world and synthetic datasets. These datasets will be sourced mainly from the domains of physics, epidemiology and bioinformatics.

Participants were asked to adapt and submit their symbolic regression algorithms to SRBench following the Competition Guide. These methods were automatically tested for conformance with the competition. Winners of the synthetic and real-world track are each entitled to a $1,250 cash prize.

Results

In total, there were 13 official competitors. After the filtering stage, 9 went on to compete in the synthetic and real-world tracks.

1st Place, Synthetic Track: QLattice
- submitted by Meera Vieira Machado & Miquel Triana Iglesias on behalf of Abzu AI
1st Place, Real-world Track: uDSR
- submitted by Brenden Petersen, Mikel Landajuela, Chak Lee, Jiachen Yang, Ruben Glatt, Ignacio Aravena Solis, Claudio Santiago, Nathan Mundhenk

Synthetic Track Rankings

In the synthetic track, methods were compared according to five properties: 1) re-discovery of exact expressions; 2) feature selection; 3) resistance to local optima; 4) extrapolation; and 5) sensitivity to noise.

	Algorithm	Overall Score
1	QLattice	6.23
2	pysr	5.26
3	uDSR	4.67
4	operon	4.38
5	Bingo	4.32
6	E2ET	2.74
7	geneticengine	2.54
8	eql	1.33
9	PS-Tree	0.85

Real-world Track Rankings

In the real-world track, competitor methods were trained to build interpretable predictive models for 14-day forecast counts of COVID-19 cases, hospitalizations, and deaths in New York State. These models were reviewed by a subject expert and assigned ``trust ratings”, in addition to being evaluted for accuracy and simplicity.

	Algorithm	Overall Score
1	uDSR	5.75
2	QLattice	5.21
3	geneticengine	4.99
4	operon	4.8
5	Bingo	4.66
6	pysr	4.17
7	PS-Tree	3.15
8	E2ET	2.72

Congratulations to the winners. Please stay tuned for additional updates, as we aim to make these results reproducible by providing all source code. The current raw results files can be found at the link below.

Organizers

Please address questions to william dot lacava at childrens dot harvard dot edu.

Michael Kommenda
- University of Applied Sciences Upper Austria
William La Cava
- Boston Children’s Hospital and Harvard Medical School
Maimuna Majumder
- Boston Children’s Hospital and Harvard Medical School
Fabricio Olivetti de França
- Federal University of ABC
Marco Virgolin
- Centrum Wiskunde & Informatica

SRBench Competition 2022: Interpretable Symbolic Regression for Data Science

Results

Synthetic Track Rankings

Real-world Track Rankings

Organizers

Sponsors