Repository logo
 
Publication

Adaptive and Scalable Database Management with Machine Learning Integration: A PostgreSQL Case Study

datacite.subject.fosEngenharia e Tecnologia::Engenharia Eletrotécnica, Eletrónica e Informática
dc.contributor.authorAbbasi, Maryam
dc.contributor.authorBernardo, Marco V.
dc.contributor.authorVaz, Paulo
dc.contributor.authorSilva, José
dc.contributor.authorMartins, Pedro
dc.contributor.authorANTUNES VAZ, PAULO JOAQUIM
dc.contributor.authorSilva, José
dc.date.accessioned2025-03-25T10:57:11Z
dc.date.available2025-03-25T10:57:11Z
dc.date.issued2024-09-18
dc.description.abstractThe increasing complexity of managing modern database systems, particularly in terms of optimizing query performance for large datasets, presents significant challenges that traditional methods often fail to address. This paper proposes a comprehensive framework for integrating advanced machine learning (ML) models within the architecture of a database management system (DBMS), with a specific focus on PostgreSQL. Our approach leverages a combination of supervised and unsupervised learning techniques to predict query execution times, optimize performance, and dynamically manage workloads. Unlike existing solutions that address specific optimization tasks in isolation, our framework provides a unified platform that supports real-time model inference and automatic database configuration adjustments based on workload patterns. A key contribution of our work is the integration of ML capabilities directly into the DBMS engine, enabling seamless interaction between the ML models and the query optimization process. This integration allows for the automatic retraining of models and dynamic workload management, resulting in substantial improvements in both query response times and overall system throughput. Our evaluations using the Transaction Processing Performance Council Decision Support (TPC-DS) benchmark dataset at scale factors of 100 GB, 1 TB, and 10 TB demonstrate a reduction of up to 42% in query execution times and a 74% improvement in throughput compared with traditional approaches. Additionally, we address challenges such as potential conflicts in tuning recommendations and the performance overhead associated with ML integration, providing insights for future research directions. This study is motivated by the need for autonomous tuning mechanisms to manage large-scale, hetero geneous workloads while answering key research questions, such as the following: (1) How can machine learning models be integrated into a DBMS to improve query optimization and workload management? (2) What performance improvements can be achieved through dynamic configuration tuning based on real-time workload patterns? Our results suggest that the proposed framework significantly reduces the need for manual database administration while effectively adapting to evolving workloads, offering a robust solution for modern large-scale data environments.eng
dc.identifier.citationAbbasi, M., Bernardo, M. V., Váz, P., Silva, J., & Martins, P. (2024). Adaptive and Scalable Database Management with Machine Learning Integration: A PostgreSQL Case Study. Information, 15(9), 574. https://doi.org/10.3390/info15090574
dc.identifier.doihttps://doi.org/10.3390/info15090574
dc.identifier.eissn2078-2489
dc.identifier.urihttp://hdl.handle.net/10400.19/9301
dc.language.isoeng
dc.peerreviewedyes
dc.publisherMDPI
dc.relation.hasversionhttps://www.mdpi.com/2078-2489/15/9/574
dc.rights.urihttp://creativecommons.org/licenses/by/4.0/
dc.subjectmachine learning integration
dc.subjectdatabase optimization
dc.subjectquery performance
dc.subjectdynamic workload management
dc.subjectPostgreSQL
dc.subjectreal-time system tuning
dc.titleAdaptive and Scalable Database Management with Machine Learning Integration: A PostgreSQL Case Studyeng
dc.typetext
dspace.entity.typePublication
oaire.citation.issue9
oaire.citation.startPage574
oaire.citation.titleInformation
oaire.citation.volume15
oaire.versionhttp://purl.org/coar/version/c_970fb48d4fbd8a85
person.familyNameANTUNES VAZ
person.familyNameSilva
person.givenNamePAULO JOAQUIM
person.givenNameJosé
person.identifier.ciencia-id351C-9899-0EE7
person.identifier.ciencia-id4A14-D3E7-5B32
person.identifier.orcid0000-0002-1745-8937
person.identifier.orcid0000-0001-7285-8282
person.identifier.scopus-author-id55447844100
relation.isAuthorOfPublication702e79ee-5b0b-47ff-989d-12e6d8ea1e89
relation.isAuthorOfPublicatione9d8719e-af47-4008-b854-817801bb3964
relation.isAuthorOfPublication.latestForDiscovery702e79ee-5b0b-47ff-989d-12e6d8ea1e89

Files

Original bundle
Now showing 1 - 1 of 1
Loading...
Thumbnail Image
Name:
information-15-00574 (2).pdf
Size:
4.1 MB
Format:
Adobe Portable Document Format
License bundle
Now showing 1 - 1 of 1
No Thumbnail Available
Name:
license.txt
Size:
1.79 KB
Format:
Item-specific license agreed upon to submission
Description: