Efficient Data Integration Strategies for Heterogeneous Big Data Sources in Cloud Environments

Authors

  • Aqil Hafizi Universiti Teknikal Darul Naim, Department of Computer Systems and Networks, Jalan Teknologi 8-3, Kota Bharu, Malaysia Author
  • Zainuddin Bin Yusof Research Assistant at Malaysia University of Science and Technology Author

Abstract

The rapid proliferation of big data across diverse domains has transformed the operational landscape of modern organizations, necessitating robust data integration strategies capable of handling large-scale, heterogeneous datasets in cloud environments. Traditional integration methods often struggle to efficiently reconcile disparate data sources, leading to issues of redundancy, inconsistency, and performance bottlenecks. In response, this paper investigates advanced solutions that leverage distributed computing infrastructures, parallel data processing frameworks, and dynamic resource allocation to unify structured, semi-structured, and unstructured data streams with minimal latency. By presenting novel theoretical paradigms rooted in rigorous mathematical models, this work emphasizes the importance of scalable architectures for orchestrating data ingestion, transformation, and fusion in real time. In addition, this research explores a flexible optimization approach that continuously balances computational load across diverse cloud infrastructures, thereby mitigating the risk of system overload and improving overall query throughput. Through detailed algorithmic analysis and experimental evaluations, the proposed strategies are assessed under a variety of deployment scenarios and workload intensities, revealing potential limitations that guide further development. These findings underscore the complexity of designing universal, high-performance data integration frameworks and highlight the promising directions for scalable, resilient infrastructures that can adapt to the evolving demands of large-scale analytics. Ultimately, this paper aims to provide a foundational blueprint for harnessing heterogeneous big data sources within a cloud-based ecosystem.

Downloads

Published

2024-09-04

How to Cite

Efficient Data Integration Strategies for Heterogeneous Big Data Sources in Cloud Environments. (2024). International Journal of Data Science, Big Data Analytics, and Predictive Modeling, 14(9), 1-15. https://kernpublic.com/index.php/IJDSBDAPM/article/view/2024-SEP-04