Skip to content

Evolution of Data Management Systems🔗

Fundamental Concepts, Methods and Applications🔗

In this three part class, students will cover the history of data management systems, from file systems to databases to distributed cloud storage. This class is given over the length of the Data Engineering course. Questions from the first two parts are integrated into the exam on cloud computing, and questions from the Cloud DMS section are integrated into the Dask notebook evaluation.

Objectives🔗

The objectives of this course are: - Introduce the fundamental concepts - Describe, in a synthetic way, the main characteristics of the evolution of DMS (Data Management Systems) - Highlight targeted application classes.

Key Words🔗

Data Management Systems, Uni-processor DBMS, Parallel DBMS, Data Integration Systems,Big Data, Cloud Data Management Systems, High Performance, Scalability, Elasticity, Multi-store/Poly-store Systems

Targeted Skills🔗

  • Effectively exploit the DMS according to the environment (uniprocessor, parallel, distributed, cloud) in a perspective of decision support within an organization.
  • Ability to choose, in a relevant way, a DMS in multiple environments for an optimal functioning of the applications of an organization

Indicative Program🔗

  1. Introduction to Main Problems of Data Management

  2. Parallel Database Systems

  3. From Distributed DB to Data Integration Systems DIS

  4. Cloud Data Management Systems CDMS

    • Motivations and Objectives
    • Main Characteristics of Big Data and CDMS
    • Classification of Cloud Data Management Systems CDMS
    • Advantages and Weakness of Parallel RDBMS and CDMS
    • Comparison between Parallel RDBMS and CDMS
    • Introduction to Multi-store/Ploystore Systems
    • Resources:
  5. Conclusion

    • Maturity of Cloud DMS
    • Key Criteria for Choosing a Data Management System

Additional Reading🔗

  1. Principles of Distributed Database Systems, M. Tamer Ozsu and Patrick Valduriez; Springer-Verlag ; Fourth Edition, December 2019.

  2. Data Management in the Cloud: Challenges and Opportunities Divyakant Agrawal, Sudipto Das, and Amr El Abbadi; Synthesis Lectures on Data Management, December 2012, Vol. 4, No. 6 , Pages 1-138.

  3. Query Processing in Parallel Relational Database Systems; H. Lu, B.-C Ooi and K.-L. Tan; IEEE Computer Society Press, CA, USA, 1994.

  4. Traitement parallèle dans les bases de données relationnelles : concepts, méthodes et applications Abdelkader Hameurlain, Pierre Bazex, Franck Morvan; Cépaduès Editions, Octobre 1996.