Sarcouncil Journal of Multidisciplinary
Sarcouncil Journal of Multidisciplinary
An Open access peer reviewed international Journal
Publication Frequency- Monthly
Publisher Name-SARC Publisher
ISSN Online- 2945-3445
Country of origin- PHILIPPINES
Frequency- 3.6
Language- English
Keywords
- Social sciences, Medical sciences, Engineering, Biology
Editors

Dr Hazim Abdul-Rahman
Associate Editor
Sarcouncil Journal of Applied Sciences

Entessar Al Jbawi
Associate Editor
Sarcouncil Journal of Multidisciplinary

Rishabh Rajesh Shanbhag
Associate Editor
Sarcouncil Journal of Engineering and Computer Sciences

Dr Md. Rezowan ur Rahman
Associate Editor
Sarcouncil Journal of Biomedical Sciences

Dr Ifeoma Christy
Associate Editor
Sarcouncil Journal of Entrepreneurship And Business Management
Improving Insurance Data Quality with Machine Learning–Driven Deduplication
Keywords: Machine learning deduplication, insurance data quality, probabilistic record linkage, fuzzy matching algorithms, supervised learning models, enterprise data governance.
Abstract: Insurance enterprises confront escalating difficulties in maintaining accurate information across intricate operational landscapes where duplicate customer records generate widespread disruptions among underwriting processes, price quotation systems, claims administration, and compliance oversight functions. Conventional rule-based duplicate elimination systems exhibit poor effectiveness when handling actual business datasets filled with natural inconsistencies, spelling errors, and incomplete entries frequently found within corporate insurance information repositories. Machine learning-powered duplicate elimination platforms deliver revolutionary capabilities using advanced probability-based models merging approximate string matching technologies alongside computer-assisted learning methods able to detect complex connections overlooked by traditional exact-matching procedures. Implementation combines numerous technological elements featuring Python Dedupe software for probability-based record connection, Azure Data Factory managing distributed computing coordination, plus Streamlit dashboards supporting human-assisted verification processes. Sophisticated partitioning techniques allow manageable processing across massive corporate databases, whereas adaptive learning programs steadily enhance identification precision by systematically incorporating specialist knowledge throughout repeated improvement cycles. Performance evaluation approaches include thorough assessment structures gauging both effectiveness and operational efficiency throughout varied information complexity situations, creating uniform standards for continuous monitoring, plus regulatory adherence needs. Corporate transformation plans highlight mutually beneficial connections linking specialist knowledge with automated functions, supporting seamless movement from manual operations toward intelligent technology platforms. Expandable design permits application toward supplementary information quality difficulties across insurance functions, enabling varied computational uses featuring fraud identification, supplier information consolidation, plus client categorization programs spanning numerous coverage categories.
Author
- Sai Madhav Reddy Nalla
- Artha Data Solutions USA