Just a couple of weeks ago I wrote a short introduction to an article and provided a link titled
I ran across another article outlining many of the same principles. It reviews some of the trade-offs of the data lake and some of the approaches that are being developed to address these short comings. It refers to a Semantic Hub model to a create a linking structure to create canonical data. Wikipedia defines
“a canonical model is a design pattern used to communicate between different data formats.”
In other words, a structure to tie all of this disparate data together.
As data gets larger and more complex we are still struggling to find the best ways to structure and/or link disparate data. In many ways EDI and XML were attempts to apply the same thinking. The problem is not new, but it keeps evolving, which requires an evolving solution.
This article speaks pretty conceptually. I am interested to see a practical application of this line of thought. I hope you enjoy the article.