INTRODUCTION TO DATABRICKS
MARCH 31 – APRIL 2, 2025 | VILNIUS, LITHUANIA
Are you curious about cutting-edge data processing technologies or looking to deepen your knowledge in Data Engineering? Join our Introduction to Databricks workshop, designed to guide you step-by-step through this powerful modern platform. Whether you’re a beginner eager to learn or an experienced data professional exploring new tools, this session is perfect for you.
WHO IS THIS COURSE FOR?
This workshop is for engineers who would like to get their hands dirty with building Data Pipelines using DataBricks platform. Understanding python will be helpful yet not crucial.
WHAT YOU NEED TO PREPARE?
You must bring a laptop with an e-mail access and a working up-to-date web browser. The workshop will take place on a Web environment from Databricks – no development environment, SDKs, or libraries will be needed on your machine.
WHAT WILL YOU GET FROM THIS COURSE?
In this workshop, you’ll gain a comprehensive understanding of Databricks’ core components and its potential for building an efficient data platform. From mastering the Data Lakehouse concept to managing users and optimizing platform costs, we’ll cover everything you need to get started and succeed. Take the first step toward elevating your data processing skills with us!
WHO IS THIS COURSE FOR?
This workshop is for engineers who would like to get their hands dirty with building Data Pipelines using DataBricks platform. Understanding python will be helpful yet not crucial.
WHAT YOU NEED TO PREPARE?
You must bring a laptop with an e-mail access and a working up-to-date web browser. The workshop will take place on a Web environment from Databricks – no development environment, SDKs, or libraries will be needed on your machine.
WHAT WILL YOU GET FROM THIS COURSE?
In this workshop, you’ll gain a comprehensive understanding of Databricks’ core components and its potential for building an efficient data platform. From mastering the Data Lakehouse concept to managing users and optimizing platform costs, we’ll cover everything you need to get started and succeed. Take the first step toward elevating your data processing skills with us!
agenda
INTRODUCTION TO DATABRICKS
- DataBricks UI overview
- Workspaces: Data Science, Data Engineering, SQL
- Databricks Architecture
SPARK WALKTHROUGH
- Spark Architecture
- File Formats
- Jobs, Stages, Tasks, executors
DATAFRAME API
- Reading data and Writing Data
- Aggreagations and Joins
- How to read dataframe operations in sparkUI
- SparkSQL
DATASOURCES
- Delta Lake
- SQL databases
- Remote servers
DELTA LAKE
- Introduction to DeltaLake and LakeHouse architecture
- ETL with Delta Live Tables
- Optimizing Delta lake
PRODUCTION USAGE
- Deploying workloads
- Testing pipelines
- Managing Access (Unity Catalog)
- Managing resources and cost
agenda
INTRODUCTION TO DATABRICKS
- DataBricks UI overview
- Workspaces: Data Science, Data Engineering, SQL
- Databricks Architecture
SPARK WALKTHROUGH
- Spark Architecture
- File Formats
- Jobs, Stages, Tasks, executors
DATAFRAME API
- Reading Data and Writing Data
- Aggreagations and Joins
- How to read dataframe operations in sparkUI
- SparkSQL
DATASOURCES
- Delta Lake
- SQL databases
- Remote servers
DELTA LAKE
- Introduction to DeltaLake and LakeHouse architecture
- ETL with Delta Live Tables
- Optimizing Delta lake
PROCUCTION USAGE
- Deploying workloads
- Testing pipelines
- Managing Access (Unity Catalog)
- Managing resources and cost
get your ticket
Join this 3-day workshop, or unlock the full potential of Databricks by opting for the Introduction to Databricks course combined with the Apache Spark and Databricks Performance Tuning course as a comprehensive 5-day learning package. Held from March 31 to April 4, this combination provides a seamless progression from foundational concepts to advanced optimization strategies. Learn more about the Apache Spark and Databricks Performance Tuning course HERE. For any questions, contact us at tickets@bigdataconference.eu or call +370 618 00999.
Ticket Information
In order to provide an invoice or a Proforma invoice for Full ticket, we would be grateful to have this information provided by email at tickets@bigdataconference.eu:
- Company details (Registration code, VAT, Address)
- Type of ticket (Full Ticket)
- Number of tickets
- Email of the attendee(s)
- Workshop title
If you have any other questions, please call +370 695 65000.
Ticket Information
To get the Proforma invoice issued, please choose Proforma invoice in Payment type field in Paysera.
If you have any other questions, please call +370 695 65000.
WORKSHOP STARTS IN:
Day(s)
:
Hour(s)
:
Minute(s)
:
Second(s)
3 DAY WORKSHOP
TICKET
INTRODUCTION TO DATABRICKS
2499 €/ excl. VAT
5 DAY WORKSHOP
TICKET
INTRODUCTION TO DATABRICKS + APACHE SPARK AND DATABRICKS PERFORMANCE TUNING
3999 €/ excl. VAT
meet the speaker
Marcin Szymaniuk is the CEO and Senior Data Engineer at TantusData, as well as an internationally recognized conference speaker. With over two decades of experience in helping clients monetize big data, Marcin leads a team of expert data engineers specializing in Data Engineering, Machine Learning (ML), ML-Ops, and Cloud technologies. He excels in solving both complex, unconventional challenges and more routine problems that require fast and efficient solutions. Marcin’s extensive experience spans various industries and project scales, with a particular focus on AI, ML, and deployment. His speaking engagements have included notable events such as Infoshare, J On the Beach, Devoxx, Huawei Eco-Connect Poland 2023, Berlin Buzzwords, Codestar, GeeCON, and Java Day Istanbul.
meet the speaker
Marcin Szymaniuk is the CEO and Senior Data Engineer at TantusData, as well as an internationally recognized conference speaker. With over two decades of experience in helping clients monetize big data, Marcin leads a team of expert data engineers specializing in Data Engineering, Machine Learning (ML), ML-Ops, and Cloud technologies. He excels in solving both complex, unconventional challenges and more routine problems that require fast and efficient solutions. Marcin’s extensive experience spans various industries and project scales, with a particular focus on AI, ML, and deployment. His speaking engagements have included notable events such as Infoshare, J On the Beach, Devoxx, Huawei Eco-Connect Poland 2023, Berlin Buzzwords, Codestar, GeeCON, and Java Day Istanbul.