A great discussion of the reality of data lakes, data warehouses and how the future is distributed.

I’m picking some points from the discussion, but I have really liked what is said here.  The participants approach the topics with great practicality devoid of the overly rosy sales discussions.

  • The data lake is not a full data integration, it’s just one level of refinement.
  • The data warehouse is not the center anymore, the future is distributed. The amount of data outside the data warehouse is bigger than what’s stored inside.
  • Most companies have accidental data architectures. Now they need to remodel their data architectures looking for synergies between different systems.
  • The data warehouse, the data lakes, and analytics; all of it needs DataOps.

https://www.eckerson.com/articles/daniel-graham-data-lakes-vs-data-warehouses

#DataLake #DataWarehouse #Analytics #BigData

Leave a comment