Loading…

Subjective data arrangement using clustering techniques for training expert systems

•Arrangement of subjective data gathered from human experts evaluations is proposed.•Several sources of information generate different similarity measures.•A new similarity measure combining complementary similarities has been proposed.•The proposed similarity is based on unsupervised learning algor...

Full description

Saved in:
Bibliographic Details
Published in:Expert systems with applications 2019-01, Vol.115, p.1-15
Main Authors: Martín de Diego, Isaac, Siordia, Oscar S., Fernández-Isabel, Alberto, Conde, Cristina, Cabello, Enrique
Format: Article
Language:English
Subjects:
Citations: Items that this one cites
Items that cite this one
Online Access:Get full text
Tags: Add Tag
No Tags, Be the first to tag this record!
Description
Summary:•Arrangement of subjective data gathered from human experts evaluations is proposed.•Several sources of information generate different similarity measures.•A new similarity measure combining complementary similarities has been proposed.•The proposed similarity is based on unsupervised learning algorithms results.•The methodolgy has been focused on the Intelligent Transportation Systems domain. The evaluation of subjective data is a very demanding task. The classification of the information gathered from human evaluators and the possible high noise levels introduced are ones of the most difficult issues to deal with. This situation leads to adopt individuals who can be considered as experts in the specific application domain. Thus, the development of Expert Systems (ES) that consider the opinion of these individuals have been appeared to mitigate the problem. In this work an original methodology for the selection of subjective sequential data for the training of ES is presented. The system is based on the arrangement of knowledge acquired from a group of human experts. An original similarity measure between the subjective evaluations is proposed. Homogeneous groups of experts are produced using this similarity through a clustering algorithm. The methodology was applied to a practical case of the Intelligent Transportation Systems (ITS) domain for the training of ES for driving risk prediction. The results confirm the relevance of selecting homogeneous information (grouping similar opinions) when generating a ground truth (a reliable signal) for the training of ES. Further, the results show the need of considering subjective sequential data when working with phenomena where a set of rules could not be easily learned from human experts, such as risk assessment.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2018.07.058