Abstract
There is growing interest in applying machine learning techniques in the field of materials science. However, the interpretation and knowledge extracted from machine learning models is a major concern, particularly as formulating an explicit model that provides insight into physics is the goal of learning. In the present study, we propose a framework that utilizes the filtering ability of feature engineering, in conjunction with symbolic regression to extract explicit, quantitative expressions for the band gap energy from materials data. We propose enhancements to genetic programming with dimensional consistency and artificial constraints to improve the search efficiency of symbolic regression. We show how two descriptors attributed to volumetric and electronic factors, from 32 possible candidates, explicitly express the band gap energy of NaCl-type compounds. Our approach provides a basis to capture underlying physical relationships between materials descriptors and target properties.
| Original language | English |
|---|---|
| Pages (from-to) | 77-83 |
| Number of pages | 7 |
| Journal | Journal of Materials Science and Technology |
| Volume | 122 |
| DOIs | |
| Publication status | Published - 20 Sept 2022 |
| Externally published | Yes |
Bibliographical note
Publisher Copyright:© 2022
Keywords
- Band gap
- Dimensional calculation
- Symbolic regression