Visible to the public Comparison of Data Mining Tools for Significance Analysis of Process Parameters in Applications to Process Fault Diagnosis

TitleComparison of Data Mining Tools for Significance Analysis of Process Parameters in Applications to Process Fault Diagnosis
Publication TypeJournal Article
Year of Publication2014
AuthorsPerzyk, Marcin, Kochanski, Andrzej, Kozlowski, Jacek, Soroczynski, Artur, Biernacki, Robert
JournalInf. Sci.
Volume259
Pagination380–392
ISSN0020-0255
Keywordsdata mining, fault diagnosis, Input variable significance, Manufacturing industries
Abstract

This paper presents an evaluation of various methodologies used to determine relative significances of input variables in data-driven models. Significance analysis applied to manufacturing process parameters can be a useful tool in fault diagnosis for various types of manufacturing processes. It can also be applied to building models that are used in process control. The relative significances of input variables can be determined by various data mining methods, including relatively simple statistical procedures as well as more advanced machine learning systems. Several methodologies suitable for carrying out classification tasks which are characteristic of fault diagnosis were evaluated and compared from the viewpoint of their accuracy, robustness of results and applicability. Two types of testing data were used: synthetic data with assumed dependencies and real data obtained from the foundry industry. The simple statistical method based on contingency tables revealed the best overall performance, whereas advanced machine learning models, such as ANNs and SVMs, appeared to be of less value.

URLhttp://dx.doi.org/10.1016/j.ins.2013.10.019
DOI10.1016/j.ins.2013.10.019
Citation KeyPerzyk:2014:CDM:2564929.2564988