A MODEL DRIVEN FRAMEWORK FOR COLLABORATIVE AND DYNAMIC DESIGN AND IMPLEMENTATION OF NOSQL-ORIENTED DATA WAREHOUSES


(Received: 10-Feb.-2024, Revised: 12-Apr.-2024 , Accepted: 27-Apr.-2024)
Nowadays, modernizing the data warehouse ecosystem is a key challenge in decision-support systems. This modernization is crucial for ensuring scalability and meeting evolving business requirements, especially with the advent of big data. A promising solution involves implementing data warehouses with contemporary data stores, such as NoSQL. In this context, we introduce in this paper a framework that leverages Model-driven Architecture (MDA) to design and implement modern data warehouses across NoSQL data stores. Our MDA approach aims to offer a collaborative, dynamic and reusable process for developing NoSQL-oriented data warehouses tailored to specific project requirements. It facilitates the automatic and dynamic generation of a hybrid data-warehouse model from its conceptual model, which encompasses structural, domain and access parameters. Moreover, our framework includes the generation of implementation code for the data warehouse, along with a set of files to validate, document and illustrate the data-warehouse schema on a target platform. Finally, we present a detailed case study to highlight the effectiveness of our MDA framework.

[1] W. H. Inmon and D. Linstedt, Data Architecture: A Primer for the Data Scientist, Elsevier Kaufman, 2014.

[2] C. Costa and M. Y. Santos, "Evaluating Several Design Patterns and Trends in Big Data Warehousing Systems," Proc. of the Int. Conf. on Advanced Information Systems Engineering, pp. 459-473, Springer, Cham, June 2018.

[3] F. Halper, "Modernizing the Organization to Support Data and Analytics," TDWI, Best Practices Report, [Online], Available: https://tdwi.org/research/2022/06/ppm-all-best-practices-report-modernizing-the-organization-support-data-analytics.aspx?tc=assetpg, 2022.

[4] D. Stodder, "Modernizing Data and Information Integration for Business Innovation," TDWI, [Online], Available: https://f.hubspotusercontent30.net/hubfs/6618383/Report%20-%20TDWI%20Best%20Practices%20-%20Q4-2021.pdf, Q4 2021.

[5] S. Chowdhury, [Online], Available: https://www.ibm.com/developerworks/analytics/library/baaugment-data-warehouse4/ba-augment-data-warehouse4-pdf.

[6] OMG, "MDA Guide Rev. 2.0," Object Management Group Model Driven Architecture (MDA), OMG Document ormsc/2014-06-01, [Online], Available: https://www.omg.org/cgi-bin/doc?ormsc/14-06-01, 2014.

[7] M. Chevalier, M. E. Malki, A. Kopliku, O. Teste and R. Tournier, "How Can We Implement a Multi-dimensional Data Warehouse Using NoSQL?," Proc. of the Int. Conf. on Enterprise Information Systems, LNBIP, vol. 241, pp. 108-130, Springer, Cham, April 2015.

[8] M. Chevalier, M. El Malki, A. Kopliku, O. Teste and R. Tournier, "Document-oriented Data Warehouses: Models and Extended Cuboids," Proc. of the 2016 IEEE 10th Int. Conf. on Research Challenges in Information Science (RCIS), DOI: 10.1109/RCIS.2016.7549351, Grenoble, France, 2016.

[9] M. Boussahoua, O. Boussaid and F. Bentayeb, "Logical Schema for Data Warehouse on Column-oriented NoSQL Databases," Proc. of the Int. Conf. on Database and Expert Systems Applications, LNISA, vol. 10439, pp. 247-256, Springer, Cham, August 2017.

[10] A. Sellami, A. Nabli and F. Gargouri, "Transformation of data warehouse schema to NoSQL graph data base," Proc. of the 18th Int. Conf. on Intelligent Systems Design and Applications (ISDA 2018), vol. 2, pp. 410-420, Vellore, India, December 6-8, 2018, Springer International Publishing, 2020.

[11] A. Vaisman, F. Besteiro and M. Valverde, "Modeling and Querying Star and Snowflake Warehouses Using Graph Databases," Proc. of New Trends in Databases and Information Systems: ADBIS 2019 Short Papers, Workshops BBIGAP, QAUCA, SemBDM, SIMPDA, M2P, MADEISD and Doctoral Consortium, Proceedings 23, pp. 144-152, Bled, Slovenia, Springer International Publishing, September 8–11, 2019.

[12] R. Benhissen, F. Bentayeb and O. Boussaid, "GAMM: Graph-based Agile Multidimensional Model," CEUR, [Online], Available: https://ceur-ws.org/Vol-3369/paper2.pdf, 2023.

[13] F. Kalna, A. Belangour, M. Banane and A. Erraissi, "MDA Transformation Process of a PIM Logical Decision-making from NoSQL Database to Big Data NoSQL PSM," Int. J. of Engineering and Advanced Technology, vol. 9, no. 1, pp. 4208-4215, 2019.

[14] D. Prakash, "NOSOLAP: Moving from Data Warehouse Requirements to NoSQL Databases," Proc. of the 14th Int. Conf. on Evaluation of Novel Approaches to Software Engineering, vol. 1: ENASE, pp. 452-458, DOI: 10.5220/0007748304520458, May 2019.

[15] R. Yangui, A. Nabli and F. Gargouri, "Automatic Transformation of Data Warehouse Schema to NoSQL Data Base: Comparative Study," Procedia Computer Science, vol. 96, pp. 255-264, 2016.

[16] L. Oukhouya, A. El Haddadi, B. Er-Raha and A. Sbai, "Automating Data Warehouse Design With MDA Approach Using NoSQL and Relational Systems," J. of Theoretical and Applied Information Technology, vol. 101, no. 23, pp. 7941-7957, 2023.

[17] A. Srai and F. Guerouate, "MDA Approach for Generating the PSM Model for the NoSQL Key-value Database, Application on Redis," Proc. of the 2023 3rd Int. Conf. on Innovative Research in Applied Science, Engineering and Technology (IRASET), pp. 1-5, Mohammedia, Morocco, 2023.

[18] F. Abdelhedi, R. Jemmali and G. Zurfluh, "Relational Databases Ingestion into a NoSQL Data Warehouse," arXiv preprint, arXiv: 2203.06949, 2022.

[19] Eclipse, "ATL Documentation," [Online], Available: https://www.eclipse.org/atl/documentation.

[20] OMG, "MDA - The Architecture of Choice for a Changing World," [Online], Available: https://www.omg.org/mda/.

[21] K. Letrache, O. El Beggar and M. Ramdani, "The Automatic Creation of OLAP Cube Using an MDA Approach," Software: Practice and Experience, vol. 47, no 12, pp. 1887-1903, 2017.

[22] W. Khan, T. Kumar, C. Zhang, K. Raj, A. M. Roy and B. Luo, "SQL and NoSQL Database Software Architecture Performance Analysis and Assessments: A Systematic Literature Review," Big Data and Cognitive Computing, vol. 7, no. 2, Atricle no. 97, DOI: 10.3390/bdcc7020097, 2023.

[23] P. J. Sadalage and M. Fowler, NoSQL Distilled: A Brief Guide to the Emerging World of Polyglot Persistence, 1st Edition, ISBN-10: 0321826620, Pearson Education, 2013.

[24] A. Meier and M. Kaufmann, SQL & NoSQL Databases, ISBN-10: 3658245484, Springer Fachmedien Wiesbaden, 2019.

[25] A. Vaisman and E. Zimányi, Data Warehouse Systems: Design and Implementation, 2nd Edition, ISBN-10: 3642546544, 2022.

[26] G. DeCandia et al., "Dynamo: Amazon’s Highly Available Key-value Store," ACM SIGOPS Operating Systems Review, vol. 41, no. 6, pp. 205-220, 2007.

[27] TPC BENCHMARK, StandardSpecification, Version 3.2.0, pp. 1-141, [Online], Available: http://tpc.org/tpc_documents_current_versions/pdf/tpc-ds_v3.2.0.pdf, June 2021.