Skip to main content

Posts

Showing posts from 2021

GROUP BY ALL - Databricks

Using SQL to analyze data in Azure Data Lake (ADLS Gen 2)

  Using SQL to analyze data in Azure Data Lake (ADLS Gen 2) As more and more data is ingested and made available in data lakes, there is a growing demand from data analysts to be able to quickly access the data and drive insights. The biggest reason fueling this demand is the ability to use existing skills like " SQL " to be able to analyze available data. In Azure, following are few of the available in a Big Data world, to make the data available to data analysts:-      1. Mounting Azure Data Lake (ADLS Gen 2) to Databricks.     2. Implementing Databricks Lakehouse pattern, built on top of Delta tables which are available as tables in Databricks.     3. Loading data in Azure Synapse Workspace. One good thing regarding the above mentioned options is that; all the options enable the data analyst to use there existing " SQL " skills to analyze the data and give them the flexibility to build solutions and products as quickly as possible and in turn enables " citiz