Clean and Simple

Clear your calendar - It's going down! Text Blocks kicks off on May 20th, and you're invited to take part in the festivities. Splash HQ (122 W 26th St) is our meeting spot for a night of fun and excitement. Come one, come all, bring a guest, and hang loose. This is going to be epic!

Speaker Name

Job Title

Company Name

Speaker Name

Job Title

Company Name

Tune druid clusters at high scale

In this meetup, we’re diving deeper into the newest real-time analytics database on the market: Druid.

We’ll discuss how to tune druid clusters at high-scale (several million events per second)

and how to run queries quickly that can handle this high traffic.

You’ll hear from our expert speakers from ironSource, Lyft, and Imply about how each company deploys druid and creates the best architectures for this cutting-edge technology.

 *Please note that this event will be in English

 See you there!


RSVP now
Text goes here
X

when

Tuesday,

March 29, 18:00 Israel Time

where

Hybrid event:

You choose between ironSource HQ or online 👩‍💻

ironSource HQ in Tel Aviv (Azrieli Sarona Tower)

121 Derech Menachem Begin | Floor 12



Our speakers

Elad Eldor

Data Infrastructure Team Leader at ironSource

Elad Eldor is a Data Infrastructure Team Leader at ironSource, working mainly with Druid, Kafka, Presto and Spark on AWS. He has 12 years of experience as a Java software engineer and 5 years as a SRE in big data linux-based clusters. Before joining ironSource, Elad was a SRE at Verint (currently Cognyte), where he developed big data applications (using Spark, Hadoop and Kafka) and handled the reliability and scalability of Spark and Kafka clusters in production. His main interests are JVM tuning, performance tuning, and cost reduction of big data clusters (Kafka, Druid, Spark, Presto).


Jonathan Kaplan

Data Infrastructure Engineer at ironSource

Jonathan Kaplan is a Data Infrastructure Engineer at ironSource, specializing in performance tuning and deployments of Big Data technologies such as Druid, Redshift, and EMR (Trino and Spark) on AWS. Prior to joining ironSource, Jonathan was a DBA Team Lead at the Israeli Military Intelligence, and a Data Engineering Team Lead as part of the IDF Covid-19 Task Force with the Israeli Ministry of Health.

Tianyu Hong

Software Engineer, Data Infrastructure team at Lyft

Tianyu Hong works as a software engineer on the Data infrastructure team at Lyft. He mainly works on Druid and Trino, enabling real-time data querying as well as ETL pipelines. He holds a Master's degree in Electrical & Computer Engineering from Carnegie Mellon University.

Rachel Pedreschi

VP of Community & Developer Relations at Imply

Rachel Pedreschi is the VP of Community & Developer Relations at Imply. A "Data Geek-ette”, Rachel is no stranger to the world of high-performance databases and data warehouses. She is a Vertica, Informix and Redbrick certified DBA on top of her work with Cassandra and has 20+ years of business intelligence and ETL tool experience. Rachel has an MBA from San Francisco State University and a BA in Mathematics from University of California, Santa Cruz.

Agenda

18:00

Welcome everyone!

Cheers and beers 🍻

18:30

 The Rise of Immediate Intelligence 

Led by Rachel Pedreschi,  VP of Community & Developer Relations @ Imply


Decision making is changing: Apache Druid is a new type of database for creating the next generation of analytics applications that maximize flexible exploration over fresh, fast-arriving data. In this talk, Rachel Pedreschi introduces these new "immediate intelligence" applications, tells the story of Druid's emergence, and describes how data pipelines built with Druid differ from those you may already be familiar with.


18:50

Know Your Data


Led by Jonathan Kaplan from ironSource’s Data Infrastructure Team

The performance of our largest internal Druid cluster (in terms of incoming traffic) started to degrade, and tuning Druid infrastructure parameters didn’t work. It forced us to take a different approach, which we call "Data first, Tuning second".

19:15

Making Druid Realtime

Led by Elad Eldor from ironSource’s Data Infrastructure Team


In our busiest internal Druid cluster (in terms of concurrent queries) queries were very slow. We’ll describe how query performance significantly improved by tuning the Druid infrastructure.

19:45

Break

20:00

Building Data Pipelines using Druid at Lyft

Led by Tianyu Hong, Software Engineer, Data Infrastructure Team @ Lyft


In this talk, we'll learn more about how Lyft builds data pipelines using Apache Druid, which is useful for several use cases including metrics tracking, model forecasting, and internal tools. We'll also talk about the challenges we faced while setting up our real-time ingestion pipeline into Druid using Apache Flink and Kafka, and how we went about solving them. 

20:20

Giveaway time 🎁

Sign up now!
Text goes here
X

Where?

Arrival instructions:

To get to our offices, please go through the building's main lobby (floor 0) and pass through the turnstiles to get to the elevators. Head to Elevator Group A and go to floor 12.

*You must bring an ID/Drivers license and a mask to enter the building

*If you're driving, type "Arie Luba Eliav" into Waze to find the paid parking lot, and park at "Azrieli Sarona". Please note, there is a parking fee.

**COVID-19 safety measures**

Masks required

Event will be indoors


[confirmation_headline]
[confirmation_messaging]
Add to Calendar
Text goes here
X
Share with Friends
Facebook
Twitter
LinkedIn
Link
CONTACT THE ORGANIZER
Google   Outlook   iCal   Yahoo
Sorry, RSVPs have closed.