News Release 6-Sep-2023

Fine-structure sensitive deep learning framework for prediction of catalytic properties with high precision

Peer-Reviewed Publication

Dalian Institute of Chemical Physics, Chinese Academy Sciences

Figure Abstract — image: A HIGH-PRECISION ML FRAMEWORK NAMED GLCNN THAT COMBINES "GLOBAL + LOCAL" FEATURES CAN CAPTURE ORIGINAL FINE STRUCTURES OF CATALYTIC SURFACES AND EXTRACT THE KEY FACTORS AFFECTING CATALYTIC PERFORMANCE FROM BOTH GEOMETRIC AND CHEMICAL/ELECTRONIC FEATURES. view more

Credit: Chinese Journal of Catalysis

Catalysis is inevitable in modern society. The fine structure of the catalytic surface has a significant impact on structural sensitive reactions. High-throughput (HT) screening and machine learning (ML) are believed to effectively explore the potential rules of these effects and accelerate the developing of the catalyst. However, reported ML frameworks are too coarse to make a precise prediction of the catalytic performance.

Currently, the two commonly used conversion methods are descriptors and graphs. However, the construction of descriptors often ignores atomic connections, making it difficult for ML models to capture detailed geometrical information most relevant to catalytic performance. The graph-based ML model inevitably loses the geometric arrangement information of adsorption sites during the process of updating nodes, and the complexity of the message passing neural network leads to its insensitivity to electronic or geometric structures and poor interpretability. Therefore, there is still a lack of interpretable ML frameworks that can simultaneously capture the features of electronic and geometric fine structures in heterogeneous catalysis.

Recently, a research team led by Prof. Yong Wang from Zhejiang University, China, created a data augmented convolutional neural network (CNN) ML framework called GLCNN, which combines "global + local" features. This framework can capture the original fine structures without complicated encoding methods by transforming catalytic surfaces and adsorption sites into two-dimensional grids and one-dimensional descriptors, respectively. The addition of data augmentation (DA) can expand the dataset and alleviate overfitting caused by insufficiency of chemical datasets. The GLCNN framework accurately predicted and distinguished the adsorption energies of OH on a set of analogous carbon-based transition metal single-atom catalysts (TMSACs) with a mean absolute error (MAE) of less than 0.1 eV, ranking the best result of popular models trained on large datasets so far. The results were published in Chinese Journal of Catalysis (DOI: 10.1016/S1872-2067(23)64467-5).

Comparing GLCNN with descriptor or graph-based models, it was found that the comparison model cannot accurately predict the OH adsorption energy of catalysts containing IB and IIB transition metals or cis/trans configurations. The prediction performance of the GLCNN model is significantly better than that of the comparison model, indicating that the combination of grids and descriptors can better reflect the electronic and fine geometrical information of catalytic active centers.

Unlike conventional CNN and descriptor-based ones with one-sided feature extraction, this fine-structure sensitive ML framework can extract the key factors that affect catalytic performance from both geometric and chemical/electronic features, such as symmetry and coordination elements, through unbiased interpretable analysis. The analysis of feature importance for descriptors part indicates that the electronic structure and symmetry element of adsorption sites are crucial, and the importance of metals is stronger than their coordination environment. Visualization analysis on each layer indicates that GLCNN can automatically extract geometrical information of chemical structures that conform to human intuition. As the layers deepen, GLCNN gradually seeks the direction of feature extraction based on basic catalytic knowledge, extracting more abstract high-dimensional features that are conducive to adsorption energy prediction. This framework provides a feasible solution for high-precision HT screening of heterogeneous catalyst with a broad physical and chemical space.

###

About the Journal

Chinese Journal of Catalysis is co-sponsored by Dalian Institute of Chemical Physics, Chinese Academy of Sciences and Chinese Chemical Society, and it is currently published by Elsevier group. This monthly journal publishes in English timely contributions of original and rigorously reviewed manuscripts covering all areas of catalysis. The journal publishes Reviews, Accounts, Communications, Articles, Highlights, Perspectives, and Viewpoints of highly scientific values that help understanding and defining of new concepts in both fundamental issues and practical applications of catalysis. Chinese Journal of Catalysis ranks at the top one journal in Applied Chemistry with a current SCI impact factor of 16.5. The Editors-in-Chief are Profs. Can Li and Tao Zhang.

At Elsevier http://www.journals.elsevier.com/chinese-journal-of-catalysis

Manuscript submission https://mc03.manuscriptcentral.com/cjcatal

Disclaimer: AAAS and EurekAlert! are not responsible for the accuracy of news releases posted to EurekAlert! by contributing institutions or for the use of any information through the EurekAlert system.