Drill, Baby, Drill: Self-Service Data Exploration using Apache Drill
Share this Session:
  Keshav Murthy   Keshav Murthy
Senior Director, Product Management
MapR Technologies
www.mapr.cm
 


 

Thursday, August 21, 2014
09:30 AM - 10:00 AM

Level:  Technical - Introductory


These days, companies first drill into their data set before they drill for gas or gold. We know that the analysis in the first drilling step deals with huge volumes of data today, but data variety is also a growing challenge.

JSON is a data format that can represent a variety of data types, so it is quickly becoming the lingua franca of data in the Internet for APIs, data exchange, data storage, and data processing. In the business intelligence world, however, SQL is the language to analyze the data. Hence, there are many “SQL-on-Hadoop” projects.

Traditional SQL isn’t friendly to non-relational data. Apache Drill unifies SQL with JSON, CSV, Parquet, and other complex/multi-structured data types in Hadoop . It enables interactive analytics using your favorite BI and analytics tools directly on your data without ETL/upfront modeling requirements. In this talk, we’ll introduce Apache Drill as the technology to drive the audacious goal of bringing self-service data exploration natively on Hadoop, and discuss its use cases.


Keshava Murthy is the Senior Product Director for MapR. Previously, he was senior architect for IBM Informix NoSQL & SQL with 20 years experience in database design & development. He lead the storage, query processing, hybrid solutions for MongoDB NoSQL API support in Informix development. He has worked on Sybase, Illustra, and Informix database systems. Keshav has developed features in the NoSQL, SQL, spatial and geospatial, timeseries, distributed queries, and transaction management. Keshav has advised and enabled IBM clients and partners to design, develop, applications using Informix. Keshav holds a Bachelors degree in Computer Science and Engineering from the University of Mysore, India.


   
Close Window