Research Article

Enhancing Life Sciences Master Data Governance with AI-Driven Data Protection and Masking

by  Vinod Thallapally
journal cover
International Journal of Computer Applications
Foundation of Computer Science (FCS), NY, USA
Volume 187 - Issue 42
Published: September 2025
Authors: Vinod Thallapally
10.5120/ijca2025925732
PDF

Vinod Thallapally . Enhancing Life Sciences Master Data Governance with AI-Driven Data Protection and Masking. International Journal of Computer Applications. 187, 42 (September 2025), 32-38. DOI=10.5120/ijca2025925732

                        @article{ 10.5120/ijca2025925732,
                        author  = { Vinod Thallapally },
                        title   = { Enhancing Life Sciences Master Data Governance with AI-Driven Data Protection and Masking },
                        journal = { International Journal of Computer Applications },
                        year    = { 2025 },
                        volume  = { 187 },
                        number  = { 42 },
                        pages   = { 32-38 },
                        doi     = { 10.5120/ijca2025925732 },
                        publisher = { Foundation of Computer Science (FCS), NY, USA }
                        }
                        %0 Journal Article
                        %D 2025
                        %A Vinod Thallapally
                        %T Enhancing Life Sciences Master Data Governance with AI-Driven Data Protection and Masking%T 
                        %J International Journal of Computer Applications
                        %V 187
                        %N 42
                        %P 32-38
                        %R 10.5120/ijca2025925732
                        %I Foundation of Computer Science (FCS), NY, USA
Abstract

The life sciences industry handles extremely sensitive master data—patient IDs, proprietary product specs, clinical trial records, and supplier compliance files. Compliance regimes such as HIPAA, GDPR, and FDA 21 CFR Part 11 require strict regimes for data access, masking, and protection. Classic Master Data Governance setups in solutions like SAP MDG deliver accuracy and consistency, yet their rule-based, static approaches to data protection do not keep pace with shifting privacy risks. This paper presents an AI-Enabled Data Protection and Masking Framework designed to sit within life sciences MDG workflows. The framework combines machine learning–based sensitive data classification, context-aware masking, and dynamic real-time access control into the MDG process. Leveraging natural language processing and pattern recognition, the solution autonomously detects sensitive columns—such as patient IDs, trial site information, and controlled-substance data—then applies masking, tokenization, or encryption based on user role, geographical location, and applicable regulatory jurisdiction. In a controlled simulation, we tested the framework and saw a drop in potential data exposure risks of more than 30%, a boost in readiness for compliance audits, and a simplification of the approval process. Results indicated that embedding AI into the Master Data Governance layer strengthens both privacy and security, yet keeps the data fit for analytics and operational choices. Life sciences companies thus gain the ability to meet regulatory demands without stifling innovation.

References
  • Khatri, V. and Brown, C. V. 2010. Designing data governance. Communications of the ACM, 53(1), 148–152.
  • Otto, B. 2011. A morphology of the organization of data governance. ECIS 2011 Proceedings. Paper 214.
  • Loshin, D. 2013. Master Data Management. Morgan Kaufmann Publishers.
  • Friedman, T. and Smith, M. 2011. The data governance imperative. Information Management, 45(4), 10–12.
  • Heudecker, N., and Beyer, M. A. 2014. Market guide for data masking. Gartner Research, G00260736.
  • Inmon, W. H., and Linstedt, D. 2014. Data Architecture: A Primer for the Data Scientist. Morgan Kaufmann Publishers.
  • Sweeney, L. 2002. k-anonymity: A model for protecting privacy. International Journal on Uncertainty, Fuzziness and Knowledge-Based Systems, 10(5), 557–570.
  • Raghupathi, W., and Raghupathi, V. 2014. Big data analytics in healthcare: Promise and potential. Health Information Science and Systems, 2(1), 3.
  • ISO/IEC 20889:2018. Privacy enhancing data de-identification terminology and classification of techniques. International Organization for Standardization, Geneva, Switzerland.
  • GDPR. 2016. Regulation (EU) 2016/679 of the European Parliament and of the Council of 27 April 2016. Official Journal of the European Union, L119, 1–88.
Index Terms
Computer Science
Information Sciences
No index terms available.
Keywords

Artificial Intelligence (AI) Master Data Governance (MDG) Data Masking Life sciences Data Protection GDPR HIPAA FDA 21 CFR Part 11 SAP MDG Real-time Access Control (RBAC) Sensitive Data Classification

Powered by PhDFocusTM