Machine Learning Applied to the Clerical Task Management Problem in Master Data Management Systems
Loading...
Fulltext URI
Document type
Files
Additional Information
Date
relationships.isAuthorOf
Journal Title
Journal ISSN
Volume Title
Source
Publisher
Gesellschaft für Informatik, Bonn
Abstract
Clerical tasks are created if a duplicate detection algorithm detects some similarity of records but not enough to allow an auto-merge operation. Data stewards review clerical tasks and make a final non-match or match decision. In this paper we evaluate different machine learning algorithms regarding their accuracy to predict the correct action for a clerical task and execute that action automatically if the prediction has sufficient confidence. This approach reduces the amount of work for data stewards by factors of magnitude.
Description
Keywords
IBM Master Data Management, MDM, Machine Learning, Random Forest, XGBoosting, Sorted Neighborhood Method, Data Fusion, Matching, Clerical Task Processing, Duplicate Detection
