DataBay: A Unified Platform for Automating Data Warehouse Management, Real-Time Data Processing, and Ensuring Data Quality and Monitoring
(ندگان)پدیدآور
Ghadimi, MostafaBaghayi, NiyushaShateri, Alireza
نوع مدرک
TextResearch Paper
زبان مدرک
Englishچکیده
As organizations increasingly depend on large-scale data for strategic decision-making, managing data warehouses has become a complex and resource-intensive challenge. This paper introduces DataBay, a unified platform designed to automate the entire data warehouse lifecycle, from data ingestion and transformation to real-time processing, monitoring, and ensuring data quality. DataBay leverages Avro for data serialization, providing optimal throughput and storage efficiency. Additionally, its automated data pipeline orchestration, along with built-in data quality checks, enhances the reliability and accuracy of insights derived from the data. The platform's architecture is highly scalable, supporting enterprise-level datasets and adapting to evolving business needs. Through its seamless integration and flexibility, DataBay helps businesses make timely, data-driven decisions and enables continuous optimization of data workflows. This paper discusses the platform's architecture, its implementation in real-world industry settings, and the significant business value it delivers by enhancing operational efficiency and empowering data-driven decision-making across organizations.
کلید واژگان
Data WarehouseData Engineering
Real-time Data Processing
Data quality
data management
Business Intelligence
Data transformation
Data Monitoring
Scalable Data Platform
شماره نشریه
2تاریخ نشر
2024-12-011403-09-11
ناشر
University of Tehranسازمان پدید آورنده
ut.ac.irdataburst.tech
databurst.tech
شاپا
2476-27762476-2784



