The Zettabyte challenge
IDC published a White Paper about the challenge of Big Data Volume in a data-driven world. IDC expects that the data volume will grow from 45 Zettabyte (ZB) in 2020 to 175 ZB in 2025. The data will be produced in various forms like transactional data, text, voices,...
Columnar analytical databases for DWH and Data Analytics
The German magazine BI Spektrum published my article on analytical databases for DWH and Data analytics. The article discusses the characteristics of columnar databases and some analytical database categories. This blog contains a very brief summary....
Q&A on Data Integration and Big Data
Roberto Zicari did a Q&A with me about Data Integration and Big Data. Covered topics are Data integration, Big Data architecture, ETL, SQL, Hadoop, Data Lake, Data Catalog, Data Quality, education. The interview is available on odbms.org with the following...
NoSQL, NewSQL, cloud-native databases
The first NoSQL databases were created in the 2000s. Companies like Google, Amazon, Twitter & Co have developed their own databases for their specific needs. Over time, many of these databases were made available as open source. This blog post gives an overview of...
JSON and ISO SQL Standard
JSON was initially developed to exchange data via RESTful APIs (Representative State Transfer Application Programming Interface). The encoding is always Unicode, mostly UTF8. Programmable Web contains a variety of links to APIs like Twitter, LinkedIn, Strava, GitHub....
DOAG 2018
The annual DOAG 2018 conference took place from 20-NOV-2018 to 23-NOV 2018 in Nuremberg. As usual, the conference was excellent with a comprehensive community schedule. Core database topics are still covered by the majority of sessions but also with a focus on trends...
DOAG Big Data Days 2018
DOAG Big Data Days 2018 took place in Dresden from 20-Sep-2018 to 21-JUN-2018 with talks around Data capital, Data catalog, Streaming, Kafka, Data Lake, visualization, and geodata. There was also a hands-on workshop about Big Data SQL and connectors. This blog post...
Getting started with Oracle Autonomous Data Warehouse (ADW)
The blog post shows screenshots of how to set up an Oracle Autonomous Data Warehouse (ADW) service in the Oracle Cloud. The blog post contains three parts: Create the ADW service Client Credentials Client configuration (SQL Developer, sqlplus) Parameter list ...
TDWI Munich 2018 – AI, Data Catalog and Automation&Agility
TDWI 2018 took place in Munich from 25-JUN-2018 to 28-JUN-2018. This blog post summarizes some of my impressions on the topics AI, Data Catalog and Automation&Agility. Artificial Intelligence AI was the main topic in many sessions. I joined Barry Devlin's...
Metadata Management reloaded
Metadata is data about data. It used to be so simple with RDBMS and defined schemas. Variety creates the biggest challenge to derive metadata on-the-fly. What about schema-on-read? Schema-on-read applies the schema when data is retrieved.
Does Big Data stand in the way to derive value from data?
„Big Data is nothing but a marketing campaign that was designed to put money in the pockets of technology vendors and their collaborators.“ (Stephen Few: Big Data, Big Dupe – Analytics Press 2018, page 31)
Cloud computing: on-premise vs on-premises (grammar issues)
I'm not a native speaker and this post itself may contain grammar errors. So this post about grammar issues seems to be at the wrong place. But with all the hype around cloud, it looks like to me that many have adopted a wrong usage of a technical term: premise vs...