Conference Speakers

  • Stream Processing
  • Event Driven
  • Real Time

Berlin 2019

October 7
Training
October 8 & 9
Conference

About the Speakers

Meet the experts from global companies like Airbus, Amazon, ING, Lyft, Netflix, Uber, and many more, who have built scalable streaming infrastructure and enterprise-grade applications.

Hear why and how they use Flink as the stream processing engine of choice for large-scale stateful applications, including real-time analytics, real-time search and content ranking, fraud/anomaly/threat detection.

View Schedule
Speaker

Conference Speakers

Zainab Abbas

Zainab Abbas

KTH Royal Institute of Technology Stockholm

X

Zainab Abbas

PhD Student at KTH Royal Institute of Technology Stockholm

Zainab Abbas is a PhD student at the KTH Royal Institute of Technology, Stockholm, and the Université catholique de Louvain, Louvain La-Neuve. She holds a joint masters degree from KTH, Stockholm, and the Polytechnic University of Catalonia (UPC), Barcelona, in Distributed Systems. Her research work is focused on performance optimization techniques for large-scale data. In particular, stream processing using modern data stream processing engines, i.e. Apache Flink.

Research

Introducing WinBro for Scalable Streaming Graph Partitioning

Adrian Ackva

Adrian Ackva

KTH Royal Institute of Technology Stockholm

X

Adrian Ackva

Research Intern at KTH Royal Institute of Technology Stockholm

Adrian Ackva is a System Research Intern at Research Institutes of Sweden (RISE). He has a background in Business Information Systems and is about to finish M. Sc. degrees specialized in data-intensive computing at KTH Royal Institute of Technology Stockholm and University of Rennes 1. Before his Master studies, he worked as a technical consultant in different projects in Germany and England, helping to get their infrastructure scalable and automated.

Research

Introducing WinBro for Scalable Streaming Graph Partitioning

Enrico Agnoli

Enrico Agnoli

Workday

X

Enrico Agnoli

Software Engineer at Workday

Agnoli Enrico is a Software Engineer at Workday. During the last 5 years, he worked on multiple technical projects as a developer, tech lead and people manager at different stages. Currently involvements:


- As architect and developer to technically lead the delivery of a new DataStreaming platform to support ML
- Investigate new technologies and deliver POC for possible new tools/products, like streaming platforms, blockchain, audibility of machine learning models and data security
- Being part of the Workday Giving&Doing foundation, he helps to organize events and raise awareness on various causes / nonprofit groups.

Studied at Politecnico of Milan and moved to Germany right after to work first on Honda’s ASIMO humanoid robots, then on automation software in one of Europe biggest datacenter for Amadeus and finally for Workday, #1 Future Fortune company of 2018. Workday's innovator of the year in 2018 for a research project on Blockchain.

Use Case

Multi-tenanted streams @Workday

Adil Akhter

Adil Akhter

ING

X

Adil Akhter

Lead Engineer at ING

Adil Akhter is a functional programmer with a focus on distributed system engineering and data-intensive application architecture. He works at ING as a Lead Engineer and involved in building a state-of-the-art Prediction Serving system. He is passionate about technology and interested in category theory, streaming analytics, scalable machine learning infrastructure, and so on. In his spare time, he hacks with Haskell and Idris, speaks at different conferences, or organises meetups.

Research

Deploying Stateful Functions-as-a-Service (FaaS) on Streaming Dataflows

Jesse Anderson

Jesse Anderson

Big Data Institute

X

Jesse Anderson

Managing Director at Big Data Institute

Jesse Anderson is a Data Engineer, Creative Engineer and Managing Director of Big Data Institute.

He works with companies ranging from startups to Fortune 100 companies on Big Data. This includes training on cutting edge technologies like Apache Kafka, Apache Hadoop and Apache Spark. He has taught over 30,000 people the skills to become data engineers.

He is widely regarded as an expert in the field and for his novel teaching practices. Jesse is published on O’Reilly and Pragmatic Programmers. He has been covered in prestigious publications such as The Wall Street Journal, CNN, BBC, NPR, Engadget, and Wired.

Use Case

Airbus makes more of the sky with Flink

Nikolas Anderson

Nikolas Anderson

Uber

X

Nikolas Anderson

Software Engineer at Uber

Nikolas is a software engineer on Uber's Driving Safety team, where he works on using sensor/context-derived insights to make inferences about events that Uber drivers experience and acting on this knowledge accordingly. Previously, Nikolas studied Mathematics and Statistics at the University of Chicago.

Use Case

Making Sense of Streaming Sensor Data: How Uber Detects On-trip Car Crashes

Steven Bairos-Novak

Steven Bairos-Novak

Pinterest

X

Steven Bairos-Novak

Software Engineer at Pinterest

Steven is a software engineer on the Data Processing Platform at Pinterest. He primarily works on Pinterest’s streaming platform, Xenon, and has helped Pinterest move from a Mesos-based micro-batch stream processing model to true streaming with Flink on YARN.

Operations

Building a Self-Service Streaming Platform at Pinterest

 

Marton Balassi

Marton Balassi

Self-employed

X

Marton Balassi

Solutions Architect at Self-employed

Marton is a Flink PMC member and one of the first contributors to the streaming API. He has driven big data adoption at around 50 customers as a Senior Solutions Architect at Cloudera between 2017 and early 2019. He has opened his own consulting company and enjoys the freelancer lifestyle since spring.

Use Case

Image Processing with Flink

Niels Basjes

Niels Basjes

bol.com

X

Niels Basjes

Principal IT Architect at bol.com

Niels Basjes (1971) has been working for bol.com since May 2008. Before that, he was working as a Webanalytics architect for Moniforce, and as an IT architect/researcher at the National Aerospace Laboratory in Amsterdam. Since the second half of the 1990s he has been working on processing problems that require scalability. He has applied these concepts in the past 20 years in aircraft/runway planning, IT operations and in the field of web analytics to build reports for some of the biggest websites in the Netherlands. Also at bol.com the primary focus of Niels Basjes are scalability problems and he is responsible for a shift in thinking about data and the business value it contains. Niels designed and implemented many of the personalization algorithms that are in production today at bol.com. Niels studied Computer Science at the TU Delft, and has Business administration degree at Nyenrode University. Niels is an active opensource developer who is one of the Apache Avro PMC members and has authored ( https://github.com/nielsbasjes/ ) and contributed various improvements and bugfixes to projects like Hadoop, HBase, Pig and Flink.

Use Case

When ordering matters

Julia_Bennett_Headshot

Julia Bennett

Netflix

X

Julia Bennett

Senior Data Engineer at Netflix

Julia Bennett is a member of the data engineering team for personalization at Netflix that delivers recommendations made for each user. The team is responsible for building large scale data processing used in training and scoring of the various machine learning models that power the Netflix UI experience. They have recently been working on moving some of the company’s core datasets from being processed in a once-a-day daily batch ETL to being processed in near real time using Apache Flink. Before joining Netflix, Julia completed her PhD in mathematics from The University of Texas At Austin.

Ecosystem

Streaming Event-Time Partitioning With Apache Flink and Apache Iceberg

Matthew Brookes

Matthew Brookes

ETH Zurich

X

Matthew Brookes

Master's Student at ETH Zurich

Studied Computing at Imperial College London which included a year-long exchange program to ETH Zurich. Wrote Master's Thesis as part of the Strymon group under the supervision of Vasia Kalavri and John Liagouris. In September 2019 began a Backend Engineer position at Monzo, London. Interested in Stream Processing and Data-Intensive Applications.

Research

Moving on from RocksDB to something FASTER

Regina Chan

Regina Chan

Goldman Sachs

X

Regina Chan

Senior Engineer at Goldman Sachs

Regina Chan is a Senior Engineer at Goldman Sachs in the Data Architecture team building solutions to service the firm’s growing demand for data. She is one of the original members of the Data Lake team building it from the ground up and has been leading the effort in rebuilding using Flink.

Ecosystem

Dynamically Generated Flink Jobs at Scale

shuyi-chen

Shuyi Chen

Uber

X

Shuyi Chen

Staff Software Engineer at Uber

Shuyi Chen is a staff software engineer at Uber working on stream processing technology. He is also committers for Apache Calcite and Apache Flink. Shuyi Chen is the TL of the Uber’s stream processing platform, which powers 1000+ streaming jobs at Uber. Shuyi has years of experience in storage infrastructure, data infrastructure, and Android and iOS development at both Google and Uber.

Operations

Towards building a unified platform for stream processing at Uber

Jamie Chittenden

Jamie Chittenden

Emirates NBD

X

Jamie Chittenden

Lead Data Architect at Emirates NBD

Jamie has been working with data for 13+ years, began his data career as an ETL developer, and now finds himself in the world of stream processing.
Over the past 4 years Jamie has been leading stream processing implementations, from the design of the dataflows, to building the engineering capabilities to support the implementation, across two banks, in two geographical locations.

Use Case

The role Stream Processing plays for an Opti Channel Experience

Michal Ciesielczyk

Michal Ciesielczyk

Deep.BI

X

Michal Ciesielczyk

Machine Learning Engineer at Deep.BI

Michał Ciesielczyk is a Machine Learning Engineer at Deep.BI. He is responsible for researching, building and integrating machine learning tools with a variety of technologies including Scala, Python, Flink, Kafka, Spark, and Cassandra. Previously, he worked as an assistant professor at Poznan University of Technology, where he received a Ph.D. in computer science and was a member of a research team working on numerous scientific and R&D projects. He has published more than 15 refereed journal and conference papers in the areas of recommender systems and machine learning.

Use Case

Real-time Stream Analytics and Scoring Using Apache Flink, Druid & Cassandra at Deep.BI

Gyula Fora

Gyula Fora

King

X

Gyula Fora

Streaming Platform Engineer at King

Gyula is a senior engineer in the Streaming Platform team at King, working on delivering innovative streaming solutions to large-scale production affecting hundreds of millions of users around the globe. 

He works mainly on building and running King’s internal Streaming Platform that powers real-time use-cases across the organization and on tooling that makes large-scale streaming deployments efficient and maintainable.

Gyula grew up in Budapest where he first started working on distributed stream processing and later became a core contributor to the Apache Flink project. Gyula has been a speaker at numerous big data related conferences and meetups, talking about stream processing technologies and use-cases.

Operations

How to configure your streaming jobs like a pro

Marios Fragkoulis

Marios Fragkoulis

Delft University of Technology

X

Marios Fragkoulis

Postdoctoral Researcher at Delft University of Technology

Marios Fragkoulis is a postdoctoral researcher at TU Delft, working on scalable stream processing. He holds a PhD in main memory data analytics from the Athens University of Economics and Business and an MSc degree from Imperial College London. Marios is the co-developer of dgsh, the directed graph shell.

Research

Deploying Stateful Functions-as-a-Service (FaaS) on Streaming Dataflows

Juan Gentile

Juan Gentile

Criteo

X

Juan Gentile

Senior Software Engineer at Criteo

Senior software engineer at Criteo. Working with Big data technologies for the last 8 years and currently developing a rule-based engine for invalid traffic detection based on Flink.

Use Case

Large Scale Real Time Ad Invalid Traffic Detection with Flink

Kenny Gorman

Kenny Gorman

Eventador.io

X

Kenny Gorman

Co-Founder and CEO at Eventador.io

Kenny has 18 years of experience with various database platforms behind some of the busiest datasets in the world. Most recently he Co-Founded ObjectRocket. He has had roles as Chief Technologist, Architect, Director, Manager, Developer, and DBA. He was a key member of the early teams that scaled Paypal and then eBay, ran one of the largest PostgreSQL installations on the planet, and was a very early adopter and Entrepreneur using MongoDB. He is an active database community member, speaker, and evangelist.
Loves vi.

Use Case

Writing a interactive SQL engine and interface for executing SQL against running streams using Flink

Roman Grebennikov

Roman Grebennikov

Findify AB

X

Roman Grebennikov

Backend Engineer at Findify AB

Roman Grebennikov is a passionate software developer from Russia with hands-on experience in software development, JVM and high-performance computation. During last years he has focused on the delivery of functional programming principles and practices to real-world data analysis and machine-learning projects.

Operations

Extending Flink state serialization for better performance and smaller checkpoint size

Philipp Grulich

Philipp Grulich

TU Berlin

X

Philipp Grulich

Research Associate at TU Berlin

Philipp is a Research Associate at Technische Universität Berlin and a PhD candidate supervised by Volker Markl. His research interests include data stream processing, query compilation, and the exploitation of modern hardware. Before joining TU Berlin, he has worked for several companies and collected experiences in frontend and backend software development. At the German Research Center for Artificial Intelligence, he joined a streaming systems oriented research project involving Apache Flink as a research assistant.
He graduated with a M.Sc. in computer science in March 2019 at TU-Berlin. Prior to that, he received his B.Sc degree at Hamburg University of Applied Sciences.

Research

Scotty: Efficient Window Aggregation with General Stream Slicing

Sijie Guo

Sijie Guo

StreamNative

X

Sijie Guo

Founder at StreamNative

Sijie Guo is the founder of StreamNative. StreamNative is an infrastructure startup, focusing on building cloud native event streaming systems around Apache Pulsar. Previously, he was the tech lead for the Messaging Group at Twitter, and worked on push notification infrastructure at Yahoo. He is also the VP of Apache BookKeeper and PMC Member of Apache Pulsar.

Ecosystem

Query Pulsar Streams using Apache Flink

Guowei Ma

Ma Guowei

Alibaba

X

Ma Guowei

Senior Technical Expert at Alibaba

2010 ~ Now Alibaba Inc.
2007 ~ 2010 Baidu Inc.

Technology Deep Dive

Flink’s New Batch Architecture

Vahid Hashemian

Vahid Hashemian

Pinterest

X

Vahid Hashemian

Software Engineer at Pinterest

Vahid is a Software Engineer and a member of the Logging Platform team at Pinterest. He is an Apache Kafka committer and focuses on enhancing the logging pipeline at Pinterest. He is currently building a platform for querying Kafka data streams using Flink. Previously, Vahid worked as a member of the Open Technologies organization at IBM.

Operations

Building a Self-Service Streaming Platform at Pinterest

Steffen Hausmann

Steffen Hausmann

Amazon Web Services

X

Steffen Hausmann

Specialist Solutions Architect Analytics at Amazon Web Services

Dr. Steffen Hausmann is a Specialist Solutions Architect for Analytics with Amazon Web Services. He has a strong background in the area of complex event and stream processing and supports customers on their cloud journey. In his spare time, he likes hiking in the nearby mountains.

Ecosystem

Build and run streaming applications with Apache Flink and Amazon Kinesis Data Analytics for Java Applications

Fabian-Hueske

Fabian Hueske

Ververica

X

Fabian Hueske

Co-founder, Software Engineer at Ververica

Fabian Hueske is a committer and PMC member of the Apache Flink® project and has been contributing to Flink since its earliest days. Fabian is a co-founder of Ververica, a Berlin-based startup devoted to fostering Flink, where he works as a software engineer and contributes to Apache Flink®. He holds a PhD in computer science from TU Berlin and is currently writing a book about “Stream Processing with Apache Flink®”.

Community

One SQL to Rule Them All – a Syntactically Idiomatic Approach to Management of Streams and Tables

vasia

Vasiliki Kalavri

ETH Zurich

X

Vasiliki Kalavri

Postdoctoral Fellow at ETH Zurich

Vasia is a postdoctoral fellow at the Systems Group of ETH Zurich and will soon be moving to Boston University as an Assistant Professor of Computer Science. She is interested in distributed stream processing, large-scale graph analytics, and the intersection of the two. Vasia is a PMC member of Apache Flink and co-author of O’Reilly’s “Stream Processing with Apache Flink”.

Research

Self-managed and automatically reconfigurable stream processing

Jeyhun Karimov

Jeyhun Karimov

DFKI GmbH

X

Jeyhun Karimov

Researcher at DFKI GmbH

I am a PhD student at TU Berlin and researcher at DFKI.

Research

AStream: Ad-hoc Shared Stream Processing

Parag Kesar

Parag Kesar

Pinterest

X

Parag Kesar

Software Engineer at Pinterest

Experienced Software Engineer with 15 years of professional experience building scalable, distributed, high-performance web applications, backend services and big data applications. Experience working with high scale systems like Apple's IdMS and Ooyala's recommendation engine.
Languages - Java, Scala, Python
Big Data - Apache Spark, HBase, Elastic Search, Couchbase NoSQL, Cassandra, Flink
Machine Learning - Content and Collaborative filtering algorithms for video recommendations based on Spark

Use Case

Real-time Experiment Analytics at Pinterest with Apache Flink

Dongwon Kim

Dongwon Kim

SK Telecom

X

Dongwon Kim

Manager at SK Telecom

Dongwon Kim is a big data architect at SK telecom. During his post-doctoral work, he was fascinated by the internal architecture of Flink and gave a talk titled “a comparative performance evaluation of Flink” at Flink Forward 2015. He introduces Flink to SK telecom, SK energy, and SK hynix to fulfill various needs for real-time streaming processing from the companies and shares the experiences at Flink Forward 2017 and 2018. He is recently working on a web service to promote the wide adoption of streaming applications companywide.

Use Case

Do Flink on Web with FLOW

konstantin-gray

Konstantin Knauf

Ververica

X

Konstantin Knauf

Senior Solutions Architect at Ververica

As a Senior Solution Architect at Ververica, Konstantin helps our clients to solve their business problems with Apache Flink and Ververica Platform. In this role, he is also one of the first people our customers turn to if a streaming application is not performing as expected. Before joining Ververica he worked as a Senior Consultant with TNG Technology Consulting, where he supported their clients mainly in the areas of Distributed Systems and Automation. Konstantin has studied Mathematics and Computer Science at TU Darmstadt specializing in Stochastics and Algorithmics.

Operations

Apache Flink Worst Practices

Aljoscha Krettek

Aljoscha Krettek

Ververica

X

Aljoscha Krettek

Software Engineer at Ververica

Aljoscha Krettek is a co-founder at Ververica where he works on the Flink APIs in the open source. He is also a PMC member at Apache Flink and Apache Beam. Before working on Flink, he studied Computer Science at TU Berlin, he has worked at IBM Germany and at the IBM Almaden Research Center in San Jose. Aljoscha has spoken at Hadoop Summit, Strata, Flink Forward and several meetups about stream processing and Apache Flink before.

Technology Deep Dive

Towards Flink 2.0: Unified Batch & Stream Processing

Aaron Levin

Aaron Levin

Stripe

X

Aaron Levin

Infrastructure Engineer at Stripe

Aaron Levin is a mathematician-turned-radio-DJ-turned software engineer working on Stripe’s real-time data team (✨Streaming✨). Aaron used to live in Berlin, but now lives in Canada’s Berlin (Montréal - not to be mistaken with Berlin, Ontario).

Technology Deep Dive

A Tale of Dual Sources: Pictures of Grief and The Job Manager’s Clock

Bowen Li

Bowen Li

Alibaba

X

Bowen Li

Senior Enigneer at Alibaba

Bowen is a committer of Apache Flink and Senior Software Engineer at Alibaba. He is currently focusing on advancing Flink as a unified data processing system and developing Flink's metadata and batch capabilities. Bowen is the host of Seattle Flink Meetup, he frequently organizes meetups and events, and give talks on Flink.

Technology Deep Dive

Unify Enterprise Data Processing System: Platform-level integration of Flink and Hive

Ben Liu

Ben Liu

Pinterest

X

Ben Liu

Software Engineer at Pinterest

Software engineer at Pinterest focusing on large scale data analytics with work experience in Spark, Hive, Flink and HBase.
Before joining, Ben Liu graduated from Stanford University as an MS student in Statistics with a background in Computer science.

Use Case

Real-time Experiment Analytics at Pinterest with Apache Flink

Max Meldrum

Max Meldrum

RISE

X

Max Meldrum

Systems Research Engineer at RISE

Max Meldrum is a researcher and systems engineer at RISE SICS in Sweden. His interests lie within distributed systems and areas it intersects with. That being, dataflow processing frameworks (e.g., Flink), scheduling, and data management. Max is one the core developers of Arcon, a distributed Rust-based dataflow runtime capable of executing stream and batch workloads efficiently at native hardware speeds.

Research

Introducing Arc: A common intermediate language for unified batch and stream analytics

robert-metzger

Robert Metzger

Ververica

X

Robert Metzger

Co-founder, Software Engineer at Ververica

Robert Metzger is a PMC member of the Apache Flink project and a co-founder and an engineering lead at Ververica. He is the author of many Flink components including the Kafka and YARN connectors. Robert studied Computer Science at TU Berlin and worked at IBM Germany and at the IBM Almaden Research Center in San Jose. He is a frequent speaker at conferences such as the Hadoop Summit, ApacheCon and meetups around the world.

Community

A year in the Apache Flink Community

How to contribute to Apache Flink

Maximilian Michels

Maximilian Michels

X

Maximilian Michels

Open-Source Software Engineer / Consultant at

Max is a software engineer and PMC member of Apache Flink and Apache Beam. During his studies at Free University of Berlin and Istanbul University, he worked at Zuse Institute Berlin on Scalaris, a distributed transactional database. Inspired by the principles of distributed systems and open-source, he helped to develop Apache Flink at Ververica and, in the course of, joined the Apache Beam community to create the Flink Runner. After maintaining the SQL layer of the distributed database CrateDB, he is now working on the portability aspects of Apache Beam.

Technology Deep Dive

Beam on Flink: How does it actually work?

Mike Mintz

Mike Mintz

Stripe

X

Mike Mintz

Infrastructure Engineer at Stripe

Mike Mintz is a software engineer on Stripe’s Streaming team. Mike previously worked in the trading industry, where it was valuable to have a unified system for historical backtesting and live trading. Mike is originally from Anchorage, Alaska, but now lives in San Francisco.

Technology Deep Dive

A Tale of Dual Sources: Pictures of Grief and The Job Manager’s Clock

David Morin

David Morin

OVH

X

David Morin

Devops Big Data at OVH

David is a Big Data devops in the Data Convergence team at OVH. He works on building architectures for OVH products around data (ingestion, analytics, storage, processing). He was introduced to Big Data with Hadoop 6 years ago and fell in love with it’s dynamic ecosystem. Since then he’s been working with every kind of system dealing with data with loads of technical challenges on his way.

Use Case

Change data capture in production with Apache Flink

Roshan Naik

Roshan Naik

Uber

X

Roshan Naik

Technical Lead - Streaming Platform at Uber

Roshan is a technical lead at Uber's stream processing platform team (Athena) and looking into problems of stream processing at scale. He was previously at Hortonworks where he architected Storm 2.0's new high performance execution engine and authored Hive's transactional streaming ingest APIs. He is a committer on Flume, Streamline and Storm. He is also author of Castor, an open source C++ library that brings the Logic paradigm to C++.

Technology Deep Dive

Demystifying Flink Memory Allocation and tuning

Oleksandr-Navitsky

Oleksandr Nitavskyi

Criteo

X

Oleksandr Nitavskyi

Software Engineer at Criteo

More than 10 years of experience in the industry.
Currently part of the SRE Kafka team in Criteo which builds Streaming Platform.
Worked for Grammarly in the past. Likes JVM and functional programming. Fun of improving development productivity.

Operations

Introspection of the Flink in production

Piotr-Nowojski

Piotr Nowojski

Ververica

X

Piotr Nowojski

Software Engineer at Ververica

Piotr Nowojski is a Software Engineer in Ververica and Flink committer working mostly on Flink’s runtime code. Previously, he was a Software Engineer in Teradata working on Presto – distributed batch SQL query engine.

Technology Deep Dive

Faster checkpointing through unaligned checkpoints

Yann Pauly

Yann Pauly

OVH

X

Yann Pauly

Senior Software Engineer at OVH

Yann is a senior software engineer in the Data Convergence team at OVH, working on creating products around data ingestion, data lakes and analytics platforms. More focused on the backend side of thing, he is passionate about API design, modularity and performance, a passion that he shares with his students as a teacher in Brest’s University (in France).

Use Case

Change data capture in production with Apache Flink

Massimo Perini

Massimo Perini

EIT

X

Massimo Perini

MSc Student / Researcher at EIT

Massimo Perini is graph analytics aficionado with deep scientific knowledge and engineering experience in the field. Massimo is currently researching online graph embedding techniques within a multi-MSc degree in Data Science from KTH in Sweden, Politecnico di Milano and Torino in Italy, while also holding a joint Computer Engineering BSc with Tongji University in China. He has been a finalist at the Xilinx Open Hardware 2018 and the winner of the Italian Statistics and Probability Competition in 2013. His general interests lie in the fields of machine learning, big data and real-time data processing.

Research

Deep Stream Dynamic Graph Analytics with Grapharis

Wojtek Ptak

Wojtek Ptak

FreshMail

X

Wojtek Ptak

CTO at FreshMail

Wojtek works as FreshMail’s CTO and independent consultant and trainer. He loves various aspects of data-driven business culture transformations and development of data and software architectures in such companies. He is also a great supporter of fostering an organization-wide learning culture.

In FreshMail he leads the product development team and works on some core business solutions, often applying Machine Learning and AI to solve problems. There, together with the team, he works on a new generation of an anti-abuse engine (fighting spam, phishing, and other attacks) that uses data stream processing, ML & AI on the scale of tens of dozens of millions of emails every day. He is a creator of some ML/AI workshops, including public ones.

For almost 10 years he co-founded the Ministry of Ideas, where he was consulting a data-driven organization's transformations and implementation of tools and processes supporting it. As the consultant and trainer he worked, among others with The Coca-Cola Company, the American Bankers Association, Macy's, Bloomingdales, Heineken, Saks 5th Avenue, BP, Boots, Polo Ralph Lauren, Homebase, Porsche, HSBC, Intel, Oracle and others. Outside of the professional life, he is an enthusiast of mountain sports - downhill, enduro, free-touring, freeride snowboarding and travel, expeditions and photography.

Use Case

Fighting phishing and spam with online machine learning on data streams

Jiangjie Qin

Jiangjie Qin

Alibaba

X

Jiangjie Qin

Staff Software Engineer at Alibaba

Jiangjie (Becket) is currently a software engineer at Alibaba where he mostly focus on the development of Apache Flink and its ecosystem. Prior to Alibaba, Becket worked at LinkedIn to build streams infrastructures around Apache Kafka after he received Master degree from Carnegie Mellon University in 2014. Becket is a PMC member of Apache Kafka.

Technology Deep Dive

Run Interactive Queries with Apache Flink

Lakshmi Rao

Lakshmi Rao

Lyft

X

Lakshmi Rao

Software Engineer at Lyft

Lakshmi is a software engineer on the streaming platform team at Lyft. The team builds and supports the core infrastructure that enables several product teams at Lyft to easily and reliably spin up Flink jobs to perform aggregations on real-time data. Most recently, she has been spending time re-architecting the platform to a Kubernetes based deployment. Prior to Lyft, Lakshmi worked in fin-tech land, building a search and information retrieval platform for Goldman Sachs.

Operations

Running Flink in production: The good, the bad and the in-between

Sören Reichardt

Sören Reichardt

Neo4j

X

Sören Reichardt

Software Engineer at Neo4j

Sören is a a software engineer in the graph analytics team at Neo4j. His interests cover working with graphs in big data environments as well as query execution engines. Prior to joining Neo4j, he was studying at Leipzig University and wrote his master thesis about Cypher on Flink.

Other

Bringing Cypher to Apache Flink

Till Rohrmann

Till Rohrmann

Ververica

X

Till Rohrmann

Engineering Lead at Ververica

Till is a PMC member of Apache Flink and engineering lead at Ververica. His main work focuses on enhancing Flink’s scalability as a distributed system. Till studied computer science at TU Berlin, TU Munich and École Polytechnique where he specialized in machine learning and massively parallel dataflow systems.

Technology Deep Dive

Flink’s New Batch Architecture

Leire Fernandez de Retana Roitegui

Leire Fernandez de Retana Roitegui

Workday

X

Leire Fernandez de Retana Roitegui

Senior Software Engineer at Workday

Leire has been a Software Engineer at Workday for the last 4 years, although it has been over a decade that she is immersed into Software development, performing multiple roles as developer, tech lead and mentor.

Leire is passionate about building quality code, from conception through implementation, testing and delivery. Being the newest member of the Data Streaming Platform team in Workday, she's excited to be given the opportunity to work with Apache Flink and explore the possibilities and challenges that it has to offer.

When not behind a computer, Leire enjoys ‘all things outdoors’, with a bit of circus arts on the side.

Use Case

Multi-tenanted streams @Workday

Francisco Javier Piqueras Ruiz

Francisco Javier Piqueras Ruiz

Indizen Technologies

X

Francisco Javier Piqueras Ruiz

Big Data Architect at Indizen Technologies

Big Data changed my life. I started working with the elephant and his friends in 2013 in one of the first big data projects in Spain for Deutsche Bank. Until now I had the opportunity to work with several teams and different countries from Mapreduce through Spark and from 2017 until the present with Flink designing and developing innovation solutions.

My current role is Big Data & Innovation Architect at Indizen Technologies. Spanish company located in Madrid and Málaga specialized in R&D for financial services.

Use Case

From BaaB to EaaS in the Financial Industry

Fares_Sabbagh

Fares Sabbagh

Criteo

X

Fares Sabbagh

Senior Software Engineer at Criteo

Senior software engineer at Criteo. Started his career working as a Freelance on Web Development. He then joined Criteo in 2015 to work as a Software Engineer in the Invalid Traffic Detection Team.

Use Case

Large Scale Real Time Ad Invalid Traffic Detection with Flink

Christophe Salperwyck

Christophe Salperwyck

Freelance

X

Christophe Salperwyck

Machine Learning Engineer at Freelance

Christophe Salperwyck started as a software engineer and then moved to machine learning. He specialised on machine learning on streaming data during his PhD in Orange. He is also interested in designing algorithms that scale such as CourboSpark, an adaptation of Spark decision tree for time series for EDF. There he also worked on creating a data lake for the 30 years of historical power plant data, mainly in HBase: 1000B points/100 TB of data.

Ecosystem

FlinkDTW: time-series pattern search at scale using Dynamic Time Warping

pacog_flinkFwd

Francisco José Guerrero Sánchez

Indizen Technologies

X

Francisco José Guerrero Sánchez

Principal Big Data & Solutions Architect at Indizen Technologies

Innovative and technological enthusiastic with a broad career (+18 years) as Technical Expert and Leader, lately focused on helping companies to take advantage of Big Data Technologies in their business.

Use Case

From BaaB to EaaS in the Financial Industry

Caito-Sherr

Caito Scherr

X

Caito Scherr

Software Engineer at

Caito is a software engineer at New Relic. Caito loves woodworking, dance, and terrible puns.

Operations

Flinking, Fast and Slow

Klas Segeljakt

Klas Segeljakt

KTH

X

Klas Segeljakt

PhD Student at KTH

Klas Segeljakt is a next-gen compilers researcher and PhD student at KTH in Sweden, currently investigating the space of programming languages and hardware acceleration for data processing. He is known for his contributions to Arc, an intermediate representation aiming to bridge the worlds of batch and stream processing, independently of the frontend language (e.g. SQL) or backend system executing the optimized code (e.g., Flink).

Research

Introducing Arc: A common intermediate language for unified batch and stream analytics

Gordon Tai

Tzu-Li (Gordon) Tai

Ververica

X

Tzu-Li (Gordon) Tai

Software Engineer at Ververica

Tzu-Li (Gordon) Tai is an Apache Flink PMC member and software engineer at Ververica. His main contributions in Apache Flink includes work on some of the most widely used Flink connectors (Apache Kafka, AWS Kinesis, Elasticsearch). Gordon was a speaker at conferences such as Flink Forward, Strata Data, as well as several Taiwan-based conferences on the Hadoop ecosystem and data engineering in general.

Operations

State Unlocked

Sherin Thomas

Sherin Thomas

Lyft

X

Sherin Thomas

Software Engineer at Lyft

Sherin is a Software Engineer at Lyft. In her career spanning 8 years, she has worked on most parts of the tech stack, but enjoys the challenges in Data Science and Machine Learning the most. Most recently she has been focussed on building products that would facilitate advances in Artificial Intelligence and Machine Learning through Streaming.
She is passionate about getting more people, especially women, interested in the field of data and has been trying her best to share her work with the community through tech talks and panel discussions. Most recently she gave a talk about Flink Streaming, at Connect 2019(a Women Who Code event) in San Francisco.
In her free time she loves to read and paint. She is also the president of the Russian Hill book club based in San Francisco and loves to organize events for her local library.

Use Case

Enabling Machine Learning with Apache Flink

Oytun Tez

Oytun Tez

MotaWord

X

Oytun Tez

CTO at MotaWord

Oytun is the co-founder and CTO of MotaWord, the world’s fastest business translation platform. Majored in linguistics, he is a software engineer by vocation. He grew an interest in collaborative workflows which MotaWord implements fully, and the automation of human collaboration. His most recent toys are Apache Flink, inline skating and kites.

Use Case

Not So Big – Flink as a true Application Framework

Jonas Traub

Jonas Traub

TU Berlin

X

Jonas Traub

Research Associate at TU Berlin

Jonas is a Research Associate at Technische Universität Berlin and the German Research Center for Artificial Intelligence (DFKI). His research interests include data stream processing, sensor data analysis, and data acquisition from sensor nodes. Jonas authored several publications related to data stream gathering, processing and transmission in the Internet of Things and will complete his PhD in March 2019 under the supervision of Volker Markl. Before he started his PhD, Jonas wrote his master thesis at the Royal Institute of Technology (KTH) and the Swedish Institute of Computer Science (SICS) / RISE in Stockholm under supervision of Seif Haridi and Volker Markl and advised by Paris Carbone and Asterios Katsifodimos. Prior to that, he received his B.Sc. degree at Baden-Württemberg Cooperative State University (DHBW Stuttgart) and worked several years at IBM in Germany and the USA. Jonas is an alumnus of "Software Campus", "Studienstiftung des deutschen Volkes" and "Deutschlandstipendium"

Research

Scotty: Efficient Window Aggregation with General Stream Slicing

Antonio Verardi

Antonio Verardi

Yelp

X

Antonio Verardi

Software Engineer - Infrastructure at Yelp

Writing code and tinkering with computers for a living, writing code and tinkering with computers for fun. Still uncertain whether he’s a Software Engineer, a Systems Engineer or a Software Reliability Engineer, keeps telling people he’s one of the computer guys at Yelp. Mainly interested in distributed systems and stream processing, has a taste for open-source software.

Operations

Kubernetes + Operator + PaaSTA = Flink @ Yelp

Timo Walther

Timo Walther

Ververica

X

Timo Walther

Software Engineer at Ververica

Timo Walther is a committer and PMC member of the Apache Flink project. He studied Computer Science at TU Berlin. Alongside his studies, he participated in the Database Systems and Information Management Group there and worked at IBM Germany. Timo works as a software engineer at Ververica. In Flink, he is mainly working on the Table & SQL API.

Technology Deep Dive

Flink's Table & SQL API: Can you keep up?

BoWang

Bo Wang

Alibaba Group

X

Bo Wang

Senior Engineer at Alibaba Group

I am a senior engineer of Alibaba Group and works on Alibaba Big Data Processing Platform for over 3 years. My work mainly focus on distributed computing, streaming computing, and distributed resource management. I have designed and developed Alibaba Distributed Computing Platform, which has been deployed among hundreds of thousands of nodes in production supporting millions of business jobs every day.

Technology Deep Dive

Towards More Efficient and Adaptive Scheduling for Flink Batch

shaoxuan_profile_pic-1

Shaoxuan Wang

Alibaba

X

Shaoxuan Wang

Senior Staff Engineer at Alibaba

Shaoxuan is a senior staff engineer and director of engineering at Alibaba, working on Flink SQL and AI platform. Prior to Alibaba, Shaoxuan was a senior software engineer working on social network and core infrastructure at Facebook. Shaoxuan received his Ph.D degree from UC San Diego. He is an Apache Flink committer and PMC member.

Technology Deep Dive

Build a Flink AI Ecosystem

Wei Wang

Wei Wang

HanSight

X

Wei Wang

Senior Engineer at HanSight

Master's degree from Chongqing University. Currently a senior big data engineer at HanSight. Mostly interested in applying machine learning technologies on fast accurate anomaly detection in streaming processing system. I’m currently researching on how to build a flexible AutoML process based on big data processing frameworks. I’m also the main contributor of the UEBA product of our company.

Use Case

Flink SQL Powered AutoML Pipeline

Patrick Wiener

Patrick Wiener

FZI Research Center for Information Technology

X

Patrick Wiener

Research Scientist at FZI Research Center for Information Technology

Patrick Wiener currently works at the FZI Research Center for Information Technology in Karlsruhe. His research interests include Distributed Computing (Cloud, Edge/Fog Computing), IoT, and Stream Processing. Patrick is an expert for infrastructure management such as containers and container orchestration frameworks. He has worked in several public-funded research projects related to Big Data Management and Stream Processing in domains such as logistics and geographical information systems.

Ecosystem

Flink for Everyone: Self-Service Data Analytics with StreamPipes

Seth Wiesman

Seth Wiesman

Ververica

X

Seth Wiesman

Solutions Architect at Ververica

Seth Wiesman is a Solutions Architect at Ververica, where he works with engineering teams inside of various organizations to build the best possible stream processing architecture for their use cases.

Operations

State Unlocked

Hao Wu

Hao Wu

HanSight

X

Hao Wu

Big Data Architect at HanSight

University of Waterloo alumni of 2012, master degree of software engineering, former Flink Forward Berlin 2017 speaker. I’m a senior big data processing architect and currently the leader of UEBA product development at HanSight, the leading cyber security company in China and the only Asian vendor in Gartner Peer Insights “Voice of the Customers” SIEM Customers’ Choice 2019. My skills span multiple big data processing frameworks (e.g., Flink, Spark, Kafka, Zookeeper), data intensive applications design and machine learning technologies. Currently I’m focusing on powering machine learning process with an AutoML architecture that enhances feature reusability, feature standardization, consistency of model training/serving and user experience, and that as a result fills the gap between data engineering and data science.

Use Case

Flink SQL Powered AutoML Pipeline

Jin Yang

Jin Yang

Uber

X

Jin Yang

Software Engineer at Uber

Jin is a software engineer on Uber’s Driving Safety team. In particular, she works with safety-related activities that happen on-trip, including detecting distracted driving behavior and potential car crashes. She has a Computer Science degree from the University of Southern California. Previously, she worked at Mercedes-Benz R&D North America to collect streaming telematics data for business insight and product improvement. Both inside and outside her work, she enjoys cultivating her interest in driving, cars, and machine learning.

Use Case

Making Sense of Streaming Sensor Data: How Uber Detects On-trip Car Crashes

Kurt Young

Kurt Young

Alibaba

X

Kurt Young

Staff Engineer at Alibaba

I work at realtime compute team in Alibaba, and mostly focus on building a unified, high-performance SQL engine based on Apache Flink.

Technology Deep Dive

What's new in 1.9.0 blink planner

Andrey Zagrebin

Andrey Zagrebin

Ververica

X

Andrey Zagrebin

Software Engineer at Ververica

Andrey Zagrebin is a Software Engineer at Ververica. Andrey’s work focuses primarily on Apache Flink’s distributed coordination and state backends. Previously, he worked as a Software Engineer at T-Mobile building a large scale infrastructure for batch and real-time analytics of customer experience. Before that, he worked at LinkResearchTools, where he developed an SEO web crawler and at Qubit Digital where he built multiple distributed streaming applications.

Technology Deep Dive

Time-To-Live: How to perform Automatic State Cleanup in Apache Flink

Philipp Zehnder

Philipp Zehnder

FZI Research Center for Information Technology

X

Philipp Zehnder

Research Scientist at FZI Research Center for Information Technology

Philipp Zehnder is a research scientist at the FZI Research Center of Information Technology and PhD student at the Karlsruhe Institute of Technology (KIT). Philipp holds a master degree in Computer Science from KIT. He was a student assistant at FZI, where he was working on the ProaSense FP7 project. His current research interests are in the areas of Distributed Stream Processing and Streaming Machine Learning. He received a Microsoft Azure for Research Award for his current research work focused on the development of distributed machine learning pipelines.

Ecosystem

Flink for Everyone: Self-Service Data Analytics with StreamPipes

Sebastian Zontek

Sebastian Zontek

Deep.BI

X

Sebastian Zontek

CEO, CTO at Deep.BI

Sebastian Zontek is the CEO, CTO and co-founder of Deep.BI, Predictive Customer Data Platform with real-time user scoring. He is an experienced IT systems architect with particular emphasis on the production use of open source systems for big data such as Flink, Cassandra, Hadoop, Spark, Kafka, Druid in BDaaS solutions (Big Data as a Service), SaaS (Software as a Service), and PaaS (Platform as a Service). Previously, CEO and main platform architect at Advertine. The Advertine network allowed to match product ads with the user preferences, predicting their purchasing intent using ML and NLP techniques.

Use Case

Real-time Stream Analytics and Scoring Using Apache Flink, Druid & Cassandra at Deep.BI