Book Review “Data Architecture: A Primer for the Data Scientist” by W.H. Inmon / D. Linstedt
The book "Data Architecture: A Primer for the Data Scientist" by W.H. Inmon / D. Linstedt contains the subtitle "Big Data, Data Warehouse and Data Vault" which summarizes pretty well the main focus of the book.The first chapter introduces and defines structured and...
DataBeat week 51/2014
Reference to blogs, tweets, discussions, etc that caught my attention during the last week.Data ModelingThe blog article "Data Vault 2.0 Staging Area learnings & suggestions" by Roelant Vos shows an approach to generate hash keys for Data Vault 2.0 in the staging...
DataBeat week 50/2014
Reference to blogs, tweets, discussions, etc that caught my attention during the last week.Data ModelingBlog post, link to web session and source code on how to use BIML to generate Data Vault. BIML (Business Intelligence Markup Language) is a XML dialect for defining...
DataBeat week 49/2014
Reference to blogs, tweets, discussions, etc that caught my attention during the last week.Data ModelingData Modeling and NoSQL? Data Modeling gets more and more important in the "schema-less" world because a suitable data model ensures data quality, performance, and...
DataBeat – week 48/2014
Reference to blogs, tweets, discussions, etc that caught my attention during the last week.Data ModelingPart 2 of the article about loading Data Vault 2.0 using SAS DI Studio. The referenced LinkedIn discussion also contains some feedback from D. Linstedt. A very...
Oracle Enterprise Manager Database Express 12c
EM 12c replaces Enterprise Manager Database Control from Oracle 12c onwards and serves basic Oracle database management and monitoring functionalities for a single database:ConfigurationStorageSecurityPerformanceIt's a lightweight tool and offers only a subset of the...
Commercial Hadoop distributions and their components
The following table shows commercial distributions and their Apache stack and some proprietary components. Hortonworks HDP 2.1 Cloudera CDH 5.0.1 MapR 4.0.0B Pivotal HD 2.0 IBM BigInsights 2.1.2 Hadoop/Yarn 2.4.0 2.3.0 2.3.0 2.2.0 2.2.0 Tez 0.4.0 Pig 0.12.1...
High-level overview of some Columnar Stores
Columnar Stores exist since a long time. There are well-known players like Vertica, Sybase IQ or Exasol. Recent trends like Cassandra and HBase or announcements from SAP, IBM, Microsoft and Oracle caused some hype for (in-memory) columnar stores. The term "columnar...
Beurer Half Marathon 2013
Beurer Half Marathon (+ Einstein-Marathon and some other distances) took place in Ulm on 29-Sep-2013. The weather was perfect. It was almost windless - just a bit cold during the waiting before start. Daimer TSS sponsored my participation and also organized food and...
Links to Oracle database white papers
Links to some Oracle database white papers: Best Practices For Implementing High Volume IoT workloads with Oracle Database 12c - April 2017 Manageability with Oracle Database 12c - June 2014 Upgrading to Oracle Database 12c - August 2013 Consolidation Best Practices:...
Google’s F1 database for AdWords
BigData - who does not think of Google who released white papers on Bigtable, Google File System (today: Colossus) and MapReduce. These papers contained lots of ideas that influenced the development of Apache Hadoop.On what kind of databases is...
OT: Beurer Half Marathon 2011
Beurer Half Marathon (+ Einstein-Marathon and other distances) took place in Ulm on 18-Sep-2011. The weather was nasty: rainy and windy - the only positive mention it was not hot. The event was again perfectly organized with one huge improvement compared to the last...