 Open Access
				Open Access 
				 Subscription Access
									Subscription Access
							Secure Semantic Databases: Integrating Knowledge Management and Encrypted Machine Learning for Intelligent Data Systems
Abstract
The combination of mass data management and artificial intelligence has resulted in the desperate need of intelligent data system to be semantically, and highly-secure. Traditional databases are very good at storing systematized data but per se they lack enlightenment of convoluted connections and importance (semantics). Instead, Knowledge Management (KM) systems, such as knowledge graphs, provide such semantic layer permitting to perform strong reasoning and question. However, feeding machine learning (ML) using this semantic data would be tend to reveal sensitive information, which can be a severe privacy and security hazard.
The paper proposes a novel model of Secure Semantic Databases (S2DB), which would integrate ideas of Knowledge Management and the privacy-awareness of Encrypted Machine Learning (EML). The S2DB system is a structure that stores and processes data on a semantic knowledge graph (data structure) to make it (intelligently) queryable and allow relation finding. Most importantly, they enable the training and inference of ML models based on this encrypted semantic information using methods like Homomorphic or Secure Multi-Party Computation (S). This is to make certain that the sensitive data contained within the knowledge base remains confidential throughout the lifecycle of ML, even by the owners of the database themselves.
The outputs of this work are an architectural version of S2DB, a prototype implementation of the work that demonstrates its viability, and a performance test on benchmark applications. The gap between utility and security of data and open the door to a new generation of smart data system will rush to fill the gap in the potential for the implementation of safety in the privacy-sensitive setting of healthcare, finance and confidential business intelligence.
References
Antoniou, G., & van Harmelen, F. (2008). A Semantic Web Primer (2nd ed.). The MIT Press.
Auer, S., Bizer, C., Kobilarov, G., Lehmann, J., Cyganiak, R., & Ives, Z. (2007). DBpedia: A Nucleus for a Web of Open Data. In The Semantic Web (pp. 722-735). Springer Berlin Heidelberg.
Dwork, C., McSherry, F., Nissim, K., & Smith, A. (2006). Calibrating Noise to Sensitivity in Private Data Analysis. In Theory of Cryptography Conference (pp. 265-284). Springer Berlin Heidelberg.
Gentry, C. (2009). A fully homomorphic encryption scheme (Doctoral dissertation, Stanford University).
Juvekar, C., Vaikuntanathan, V., & Chandrakasan, A. (2018). {GAZELLE}: A Low Latency Framework for Secure Neural Network Inference. In 27th USENIX Security Symposium (USENIX Security 18) (pp. 1651-1669).
Konečný, J., McMahan, H. B., Yu, F. X., Richtárik, P., Suresh, A. T., & Bacon, D. (2016). Federated Learning: Strategies for Improving Communication Efficiency. arXiv preprint arXiv:1610.05492.
Lassila, O., & Swick, R. R. (1999). Resource Description Framework (RDF) Model and Syntax Specification. W3C Recommendation. World Wide Web Consortium.
McGuinness, D. L., & van Harmelen, F. (2004). OWL Web Ontology Language Overview. W3C Recommendation. World Wide Web Consortium.
Popa, R. A., Redfield, C. M., Zeldovich, N., & Balakrishnan, H. (2011). CryptDB: Protecting Confidentiality with Encrypted Query Processing. In Proceedings of the Twenty-Third ACM Symposium on Operating Systems Principles (pp. 85-100).
Ryffel, T., Trask, A., Dahl, M., Wagner, B., Mancuso, J., Rueckert, D., & Passerat-Palmbach, J. (2018). A generic framework for privacy preserving deep learning. arXiv preprint arXiv:1811.04017.
Refbacks
- There are currently no refbacks.