TY - CONF
T1 - Interpreting Machine Learning Models for Geochemistry Data Classification using Decision Boundary Maps
AU - Wang, Yu
AU - Qiu, Kunfeng
AU - Telea, Alex
AU - Hou, Zhaoliang
AU - Yu, Haocheng
PY - 2023
Y1 - 2023
N2 - Machine learning has been shown to be a highly effective method for classifying geochemistry data, such as mineral forming environments and rock tectonics. However, it can be difficult to understand the decision-making processes of these models. To address this issue, we propose the use of Decision Boundary Maps (DBMs) as a visualization tool for interpreting machine learning models. These maps project high-dimensional geochemistry data onto a 2D plane and depict the decision boundaries in the projected space, providing a visual representation of the algorithms decision-making processes. In addition, DBMs can reveal trends, correlations, and outliers in the data, helping to interpret the results obtained from machine learning-based geochemistry data classification. Seeing the positions of data points, rather than just class labels, is especially valuable because samples in geological categories often follow a sequence, such as a magmatic to hydrothermal transition. Observing the positions of data points allows for the identification of trends from one class to an adjacent class.
AB - Machine learning has been shown to be a highly effective method for classifying geochemistry data, such as mineral forming environments and rock tectonics. However, it can be difficult to understand the decision-making processes of these models. To address this issue, we propose the use of Decision Boundary Maps (DBMs) as a visualization tool for interpreting machine learning models. These maps project high-dimensional geochemistry data onto a 2D plane and depict the decision boundaries in the projected space, providing a visual representation of the algorithms decision-making processes. In addition, DBMs can reveal trends, correlations, and outliers in the data, helping to interpret the results obtained from machine learning-based geochemistry data classification. Seeing the positions of data points, rather than just class labels, is especially valuable because samples in geological categories often follow a sequence, such as a magmatic to hydrothermal transition. Observing the positions of data points allows for the identification of trends from one class to an adjacent class.
U2 - 10.5194/egusphere-egu23-10228
DO - 10.5194/egusphere-egu23-10228
M3 - Abstract
ER -