Workshops - Big Data Conference Europe 2021

Workshop No:

1

Developing Performant Data Streaming Applications Using Kafka

Carlos Manuel Duclos-Vergara

In this workshop we will look deeper into the architecture of Kafka in order to understand how we can get the best performance. During the course of this workshop we will build some applications in order to highlight the different performance outcomes based on the way we design the application.

Workshop No:

2



Spark and HADOOP

Lidor Gerstel

The Workshop will cover basic concepts of Hadoop and mostly in The Cloudera stack, like using HBase & Impala to query data, using Spark to stream data, afterwards we will launch a Cloudera quickstart, using datasets of top-rated movies in the workshops, getting the data analyzed and queried with Hadoop, explaining & demonstrating Map Reduce Concepts, RDD Partition on Spark.

Workshop No:

3



ONNX runtime to serve AI models

Mauro Bennici

The workshop will cover the basics of a Machine Learning project, from start to production release. We will concentrate on the optimization part of the chosen model. You will learn to use the ONNX Runtime to serve the model, check the performance compared to the initial model, and use a programming language different from the starting one.

Workshop No:

4



Improving Performance and Security in MySQL

Lukas Vileikis

This workshop will cover the things that developers and DBAs can do to improve security in MySQL by mainly covering security-related issues pertaining to MySQL, but also putting some performance aspects into the mix – people will learn how to secure their MySQL instances and keep them performing at the best of their ability at the same time.

Workshop No:

5



An introduction to FluxLang

Riccardo Tommasini

Flux is a lightweight data scripting language for fast-prototyping streaming and time-series databases. It is maintained by InfluxData, i.e., the company behind the most popular time-series database. This half-day course provides an introduction to the InfluxDB 2.0 and It covers fundamentals about time series analysis and stream processing. Central to the course is the use of Fluxlang by InfluxData. The course will introduce you to Flux core concepts and it will make use of Influx Cloud free tier.

BIG DATA CONFERENCE

EUROPE 2021

Online Edition

WORKSHOPS LIST

Developing Performant Data Streaming Applications Using Kafka

Spark and HADOOP

ONNX runtime to serve AI models

Improving Performance and Security in MySQL

An introduction to FluxLang