ALMEIDA, Í. C.; http://lattes.cnpq.br/6900215640989438; ALMEIDA, Ícaro Chagas de.
Resumen:
The Department of Informatics of the Unified Health System (DATASUS) provides essential data for analyzing the health situation in Brazil, supporting decision-making and the development of health intervention programs. However, this data is provided in a format that is not directly compatible with popular tools such as Excel or Google Sheets, making it less accessible to researchers, analysts, and healthcare professionals. Additionally, there is the challenge associated with the significant volume of data, characterized as Big Data. The massive scale of this data demands efficient data engineering approaches for processing and storage to provide a robust and scalable system for handling it. In light of this problem, this work proposes the development of an automated process for extracting 47 tables available in dbc format files from 13 DATASUS databases, storing them in a data warehouse, and making the data available through an API. This initiative is expected to facilitate health data analysis in Brazil by providing simplified access and ready-to-analyze data to researchers, analysts, and healthcare professionals.