Symbolic regression in materials science via dimension-synchronous-computation

Changxin Wang, Yan Zhang, Cheng Wen, Mingli Yang, Turab Lookman, Yanjing Su*, Tong Yi Zhang

*Corresponding author for this work

Research output: Contribution to journalJournal Articlepeer-review

28 Citations (Scopus)

Abstract

There is growing interest in applying machine learning techniques in the field of materials science. However, the interpretation and knowledge extracted from machine learning models is a major concern, particularly as formulating an explicit model that provides insight into physics is the goal of learning. In the present study, we propose a framework that utilizes the filtering ability of feature engineering, in conjunction with symbolic regression to extract explicit, quantitative expressions for the band gap energy from materials data. We propose enhancements to genetic programming with dimensional consistency and artificial constraints to improve the search efficiency of symbolic regression. We show how two descriptors attributed to volumetric and electronic factors, from 32 possible candidates, explicitly express the band gap energy of NaCl-type compounds. Our approach provides a basis to capture underlying physical relationships between materials descriptors and target properties.

Original languageEnglish
Pages (from-to)77-83
Number of pages7
JournalJournal of Materials Science and Technology
Volume122
DOIs
Publication statusPublished - 20 Sept 2022
Externally publishedYes

Bibliographical note

Publisher Copyright:
© 2022

Keywords

  • Band gap
  • Dimensional calculation
  • Symbolic regression

Fingerprint

Dive into the research topics of 'Symbolic regression in materials science via dimension-synchronous-computation'. Together they form a unique fingerprint.

Cite this