Track Overview

Modern Data Architectures, Pipelines, & Streams

Data engineering is the practice of delivering high-fidelity, custom access to data in order to serve the varied needs of a business. The rich and engaging experiences many of us expect online today (e.g. personalized news feeds, highly-relevant search engines & recommender systems, smart home assistants) are powered by modern data pipelines and architectures that form the foundation of data engineering.

This field continues to evolve at a high pace thanks to the efforts of players who push systems to new heights or develop new patterns of usage that become the standards of tomorrow. How should you think about your data engineering problems? Come to this track to learn more.


From this track

Session + Live Q&A Data Streams

Building & Operating High-Fidelity Data Streams

Monday Nov 8 / 11:10AM EST

The world we live in today is fed by data. From self-driving cars and route planning to fraud prevention, to content and network recommendations, to ranking and bidding, our world not only consumes low-latency data streams, it adapts to changing conditions modeled by that data. While...

Sid Anand

Chief Architect @Datazoom, PMC @ApacheAirflow

Session + Live Q&A Data Streams

Microservices to Async Processing Migration at Scale

Monday Nov 8 / 12:10PM EST

Netflix creates and analyzes operational and analytical data associated with playback of thousands of titles by over 200 Million members worldwide. The data powers product features such as members’ ability to see and manage their viewing history. The data also feeds into the core business...

Sharma Podila

Software Engineer @Netflix

Session + Live Q&A Big Data

Protecting User Data via Extensions on Metadata Management Tooling

Monday Nov 8 / 01:10PM EST

In a world where data collection is ever-increasing and new and expanded data protection laws like GDPR and CCPA are introduced yearly, metadata management, the act of storing contextual information about collected and stored data, has become a required staple for many companies. This talk gives...

Alyssa Ransbury

Security Engineer @Square

PANEL DISCUSSION + Live Q&A Data Streams

Managing Data at Scale

Monday Nov 8 / 02:10PM EST

Since the advent of the internet, the need for reliable, low latency access to data has grown at a rapid pace. Data Infrastructure, which was once a single monolithic database, has evolved into a tapestry of point solutions tied together by data movement infrastructure (e.g. data replication...

Mark Grover

Co-founder @Stemma_ai & co-creator of Amundsen

Shirshanka Das

Founder of LinkedIn DataHub, Apache Gobblin, Acryl Data

Chris Riccomini

Distinguished Engineer @WePay


Speakers from this track

Sid Anand

Chief Architect @Datazoom, PMC @ApacheAirflow

Sid Anand currently serves as the Chief Architect for Datazoom. Prior to joining Datazoom, Sid served as PayPal's Chief Data Engineer, focusing on ways to realize the value of data. Prior to joining PayPal, he held several positions including Agari's Data Architect, a Technical Lead in...

Read more
Find Sid Anand at:

Sharma Podila

Software Engineer @Netflix

Software Engineering leader, system builder, collaborator, mentor. Deep expertise in cloud resource management, distributed systems, data infrastructure. Proven track record of delivering impactful large scale distributed systems of cross functional scope.

Read more
Find Sharma Podila at:

Alyssa Ransbury

Security Engineer @Square

Alyssa Ransbury is a Security Engineer at Square. She supports and leads data security engineering efforts across dozens of product teams and is interested in finding data where it shouldn’t be.

Read more
Find Alyssa Ransbury at:

Mark Grover

Co-founder @Stemma_ai & co-creator of Amundsen

Mark is the co-founder of Stemma. He is the co-creator of the leading open-source data catalog, Amundsen, used by Lyft, Instacart, Square, ING, Snap and many more!Mark was previously a developer on Apache Spark at Cloudera and is a committer and PMC member on a few open-source Apache...

Read more
Find Mark Grover at:

Shirshanka Das

Founder of LinkedIn DataHub, Apache Gobblin, Acryl Data

Shirshanka is co-founder and CEO of Acryl Data, the company which is commercializing the open source DataHub project, a real-time metadata platform used by LinkedIn, Expedia, Saxo Bank, Klarna, Viasat, and many others.Prior to founding Acryl, he was the overall architect for...

Read more
Find Shirshanka Das at:

Chris Riccomini

Distinguished Engineer @WePay

Chris Riccomini is a software engineer, startup investor, and advisor with more than a decade of experience at major tech companies such as PayPal, LinkedIn, and WePay. He has been involved in open source throughout his career and is the author of Apache Samza. He's recently written The...

Read more
Find Chris Riccomini at:

Track Date

Monday Nov 8 / 11:00AM EST

Topics

Big Data

Share

Track Host

Sid Anand

Chief Architect @Datazoom, PMC @ApacheAirflow

Sid Anand currently serves as the Chief Architect for Datazoom. Prior to joining Datazoom, Sid served as PayPal's Chief Data Engineer, focusing on ways to realize the value of data. Prior to joining PayPal, he held several positions including Agari's Data Architect, a Technical Lead in...

Read more
Find Sid Anand at: