Interpretable Symbolic Regression for Data Science: Analysis of the 2022 Competition

F. O. de Franca,M. Virgolin, M. Kommenda,M. S. Majumder,M. Cranmer,G. Espada,L. Ingelse,A. Fonseca,M. Landajuela,B. Petersen,R. Glatt, N. Mundhenk,C. S. Lee,J. D. Hochhalter, D. L. Randall,P. Kamienny, H. Zhang,G. Dick,A. Simon,B. Burlacu, Jaan Kasak, Meera Machado,Casper Wilstrup,W. G. La Cava

CoRR（2023）

引用 0|浏览27

暂无评分

摘要

Symbolic regression searches for analytic expressions that accurately describe studied phenomena. The main attraction of this approach is that it returns an interpretable model that can be insightful to users. Historically, the majority of algorithms for symbolic regression have been based on evolutionary algorithms. However, there has been a recent surge of new proposals that instead utilize approaches such as enumeration algorithms, mixed linear integer programming, neural networks, and Bayesian optimization. In order to assess how well these new approaches behave on a set of common challenges often faced in real-world data, we hosted a competition at the 2022 Genetic and Evolutionary Computation Conference consisting of different synthetic and real-world datasets which were blind to entrants. For the real-world track, we assessed interpretability in a realistic way by using a domain expert to judge the trustworthiness of candidate models.We present an in-depth analysis of the results obtained in this competition, discuss current challenges of symbolic regression algorithms and highlight possible improvements for future competitions.

查看译文

关键词

data science,analysis

AI 理解论文

溯源树

样例

生成溯源树，研究论文发展脉络

Chat Paper

正在生成论文摘要