Comprehensive Lakehouse Data Architecture Model for College Accreditation
Abstract
Accreditation is an assessment activity that determines the feasibility of study programs at a university. College accreditation data comes from various sources and includes multiple data types: semi-structured, unstructured, or structured. Over time, the volume of data will continue to grow and develop, so there is a possibility of data redundancy and a long time to collect the data needed for accreditation activities. The solution is integrating data. This research aims to design a data architecture to facilitate the management of university accreditation data using the Lakehouse data architecture model. All data types can be stored on one platform in the Lakehouse data architecture. In this research, the identification, integration, and data transformation process for university accreditation data is carried out. The data used in this research is academic data in which there are with. The study's results provide an overview of the data flow process in the Lakehouse data architecture model to help better manage university accreditation data. This architecture also supports real-time data analysis so that the accreditation process can be carried out more effectively and efficiently.
Keywords: accreditation, data analysis, data architecture, data lakehouse, data warehouse