GOODS – How to post-hoc organize the Data Lake

Google describes in a recent paper „Goods: Organizing Google’s Datasets“ their approach for post-hoc metadata management. GOODS (GOOgle Dataset Search) is a system that crawls internal storage systems (e.g. GoogleFS, Bigtable,...

Read More