INTRODUCTION TO DATABRICKS

MARCH 31 – APRIL 2, 2025 | VILNIUS, LITHUANIA

Are you curious about cutting-edge Data processing technologies or looking to deepen your knowledge in Data Engineering? Join our Introduction to Databricks workshop, designed to guide you step-by-step through this powerful modern platform. Whether you’re a beginner eager to learn or an experienced Data professional exploring new tools, this session is perfect for you.

WHO IS THIS COURSE FOR?

This workshop is for engineers who would like to get their hands dirty with building Data Pipelines using Databricks platform. Understanding python will be helpful yet not crucial.

WHAT YOU NEED TO PREPARE?

You must bring a laptop with an e-mail access and a working up-to-date web browser. The workshop will take place on a Web environment from Databricks – no development environment, SDKs, or libraries will be needed on your machine.

WHAT WILL YOU GET FROM THIS COURSE?

In this workshop, you’ll gain a comprehensive understanding of Databricks’ core components and its potential for building an efficient Data platform. From mastering the Data Lakehouse concept to managing users and optimizing platform costs, we’ll cover everything you need to get started and succeed. Take the first step toward elevating your Data processing skills with us!

WHO IS THIS COURSE FOR?

This workshop is for engineers who would like to get their hands dirty with building Data Pipelines using Databricks platform. Understanding python will be helpful yet not crucial.

WHAT YOU NEED TO PREPARE?

You must bring a laptop with an e-mail access and a working up-to-date web browser. The workshop will take place on a Web environment from Databricks – no development environment, SDKs, or libraries will be needed on your machine.

WHAT WILL YOU GET FROM THIS COURSE?

In this workshop, you’ll gain a comprehensive understanding of Databricks’ core components and its potential for building an efficient Data platform. From mastering the Data Lakehouse concept to managing users and optimizing platform costs, we’ll cover everything you need to get started and succeed. Take the first step toward elevating your Data processing skills with us!

agenda

INTRODUCTION TO DATABRICKS

  1. Databricks UI overview
  2. Workspaces: Data Science, Data Engineering, SQL
  3. Databricks Architecture

SPARK WALKTHROUGH

  1. Spark Architecture
  2. File Formats
  3. Jobs, Stages, Tasks, Executors

DATAFRAME API

  1. Reading Data and Writing Data
  2. Aggreagations and Joins
  3. How to read DataFrame operations in SparkUI
  4. SparkSQL

DATASOURCES

  1. Delta Lake
  2. SQL databases
  3. Remote servers

DELTA LAKE

  1. Introduction to Delta Lake and LakeHouse architecture
  2. ETL with Delta Live Tables
  3. Optimizing Delta Lake 

PRODUCTION USAGE

  1. Deploying workloads
  2. Testing pipelines
  3. Managing Access (Unity Catalog)
  4. Managing resources and cost

agenda

INTRODUCTION TO DATABRICKS

  1. Databricks UI overview
  2. Workspaces: Data Science, Data Engineering, SQL
  3. Databricks Architecture

SPARK WALKTHROUGH

  1. Spark Architecture
  2. File Formats
  3. Jobs, Stages, Tasks, executors

DATAFRAME API

  1. Reading Data and Writing Data
  2. Aggreagations and Joins
  3. How to read DataFrame operations in SparkUI
  4. SparkSQL

DATASOURCES

  1. Delta Lake
  2. SQL Databases
  3. Remote servers

DELTA LAKE

  1. Introduction to Delta Lake and LakeHouse architecture
  2. ETL with Delta Live Tables
  3. Optimizing Delta lake

PROCUCTION USAGE

  1. Deploying workloads
  2. Testing pipelines
  3. Managing Access (Unity Catalog)
  4. Managing resources and cost

get your ticket

Join this 3-day workshop, or unlock the full potential of Databricks by opting for the Introduction to Databricks course combined with the Apache Spark and Databricks Performance Tuning course as a comprehensive 5-day learning package. Held from March 31 to April 4, this combination provides a seamless progression from foundational concepts to advanced optimization strategies. Learn more about the Apache Spark and Databricks Performance Tuning course HERE. For any questions, contact us at tickets@bigdataconference.eu or call +370 618 00999.

X

Ticket Information

In order to provide an invoice or a Proforma invoice for Full ticket, we would be grateful to have this information provided by email at tickets@bigdataconference.eu:

  • Company details (Registration code, VAT, Address)
  • Type of ticket (Full Ticket)
  • Number of tickets
  • Email of the attendee(s)
  • Workshop title

If you have any other questions, please call +370 695 65000.

X

Ticket Information

To get the Proforma invoice issued, please choose Proforma invoice in Payment type field in Paysera.

If you have any other questions, please call +370 695 65000.

WORKSHOP STARTS IN:

Day(s)

:

Hour(s)

:

Minute(s)

:

Second(s)

3 DAY WORKSHOP

TICKET

INTRODUCTION TO DATABRICKS

2499 €/ excl. VAT

5 DAY WORKSHOP

TICKET

INTRODUCTION TO DATABRICKS + APACHE SPARK AND DATABRICKS PERFORMANCE TUNING

3999 €/ excl. VAT

meet the speaker

Marcin Szymaniuk is the CEO and Senior Data Engineer at TantusData, as well as an internationally recognized conference speaker. With over two decades of experience in helping clients monetize Big Data, Marcin leads a team of expert Data engineers specializing in Data Engineering, Machine Learning (ML), ML-Ops, and Cloud technologies. He excels in solving both complex, unconventional challenges and more routine problems that require fast and efficient solutions. Marcin’s extensive experience spans various industries and project scales, with a particular focus on AI, ML, and deployment. His speaking engagements have included notable events such as Infoshare, J On the Beach, Devoxx, Huawei Eco-Connect Poland 2023, Berlin Buzzwords, Codestar, GeeCON, and Java Day Istanbul.

 

meet the speaker

Marcin Szymaniuk is the CEO and Senior Data Engineer at TantusData, as well as an internationally recognized conference speaker. With over two decades of experience in helping clients monetize Big Data, Marcin leads a team of expert Data engineers specializing in Data Engineering, Machine Learning (ML), ML-Ops, and Cloud technologies. He excels in solving both complex, unconventional challenges and more routine problems that require fast and efficient solutions. Marcin’s extensive experience spans various industries and project scales, with a particular focus on AI, ML, and deployment. His speaking engagements have included notable events such as Infoshare, J On the Beach, Devoxx, Huawei Eco-Connect Poland 2023, Berlin Buzzwords, Codestar, GeeCON, and Java Day Istanbul.

 

workshop venue