The Data Thread 2022

In Case You Missed It Recordings from The Data Thread are available now!

WATCH HERE
Free Virtual Conference with 25+ sessions

Members of the Apache Arrow ecosystem came together to exchange insights and move the community forward.

The Data Thread LIVE CONTENT
Keynote by Wes McKinney and Jacques Nadeau, co-creators of Apache Arrow. High performance computing panel. Fireside chat with Anaconda CEO Peter Wang and Voltron Data CEO Josh Patterson. Q&A session with the featured speakers.

The Data Thread RECORDED CONTENT
Technical talks covering a wide variety of topics within the Arrow ecosystem.

Keynotes:

Wes McKinney

Co-Founder & CTO, Voltron Data

Read Bio »

Jacques Nadeau

Co-Founder & CEO, Sundeck

Read Bio »

Featured Speakers:

Peter Wang

Peter Wang

Chief Executive Officer, Anaconda Inc.

Josh Patterson

Josh Patterson

Co-Founder & CEO, Voltron Data

James Pivarski

Jim Pivarski

Computational Physicist, Princeton University

Sebastián Estévez

Sebastián Estévez

Principal Engineer, Datastax

Carlos Maltzahn

Carlos Maltzahn

Founder & Director, University of California - Santa Cruz (CROSS)

Fernanda Foertter

Fernanda Foertter

Director of DevRel and HPC Business Development, Voltron Data

Rodrigo Aramburu

Rodrigo Aramburú

Chief Product Officer, Voltron Data

Darren Haas

Darren Haas

Chief Business Officer, Voltron Data

Jing Brewer

Jing Brewer

Vice President of Product Strategy, Voltron Data

Morgan Mahlock

Morgan Mahlock

Director of Product Strategy, Voltron Data

Marlene Mhangami

Marlene Mhangami

Developer Advocate, Voltron Data

Session

A: Navigating the San Francisco Art Scene with Ibis

Watch my session

B: An Introduction to Arrow for Python Programmers

Watch my session

Recorded Speakers:

Vibhatha Abeykoon

Vibhatha Abeykoon

Software Engineer, Voltron Data

Session

Acero: An Arrow native C++ streaming query engine

Zaid Al-Ars

Zaid Al-Ars

Director of Software Engineering, Voltron Data

Session

Maximizing the Performance of DNA Analysis Using Apache Arrow

Arthur Andres

Arthur Andres

Software Developer, Tradewell Technologies

Session

Put Your Cassandra Python Driver On Steroids With Apache Arrow

Will Ayd

Will Ayd

Owner, innobi

Session

A Developers' Journey Using Arrow with Tableau

Jayjeet Chakraborty

Jayjeet Chakraborty

Ph.D. Student, University of California - Santa Cruz. Department of Computer Science and Engineering

Session

Embedding Apache Arrow inside Storage Systems

Patrick Clarke

Patrick Clarke

Product Manager, Voltron Data

Session

What Is Ibis + Simple Demo

Phillip Cloud

Phillip Cloud

Principal Engineer, Voltron Data

Session

How to Get Rid of Stringly-Typed Analytics

Ian Cook

Ian Cook

Director of Product Management, Voltron Data

Session

Arrow and Substrait: Better Together

Nicola Crane

Nic Crane

Software Engineer, Voltron Data

Session

Contributing to the Arrow R Package - Get Involved!

Andrew Crotty

Andrew Crotty

Assistant Professor, Northwestern University

Session

Everyone Should Use Apache Arrow for Data Systems Research

Dewey Dunningham

Dewey Dunnington

Senior R Developer, Voltron Data

Session

Accelerating Geospatial Computing in R and Python Using Apache Arrow

James Duong

James Duong

Lead Software Developer, Bit Quill

Session

Arrow Flight SQL: Accelerating Database Access

Henry Ehrenberg

Henry Ehrenberg

Co-Founder, Snorkel AI

Session

Powering Data-Centric AI with Arrow

Venkatesh Emani

Venkatesh Emani

Senior Scientist, Microsoft

Session

PyFroid: Scaling Data Preparation Using Database

Alenka Frim

Alenka Frim

Open Source Apprentice, Voltron Data

Session

How to Use the New Contributor’s Guide to Start Contributing to Apache Arrow

Bogdan Ghit

Bogdan Ghit

Tech Lead & Senior Software Engineer, Databricks

Session

Building a High-Throughput Data Extract Architecture

Jason Hughes

Jason Hughes

Director of Product Management, Dremio

Session

Apache Arrow Flight SQL- Enabling Universal ODBC & JDBC Drivers

Li Jin

Li Jin

Software Developer, Two Sigma

Session

Time Series Data Transformation with Arrow Compute Engine

Oliver Kennedy

Oliver Kennedy

Associate Professor at University, Buffalo

Session

Microkernel Notebooks

Sutou Kouhei

Sutou Kouhei

Co-Founder & President, ClearCode

Session

Why Apache Arrow is Important for Ruby

Andrew Lamb

Andrew Lamb

Database Engineer, InfluxData

Session

Apache Arrow and DataFusion: Changing the Game for Implementing Database Systems

Jorge Leitão

Jorge Leitão

Co-founder and Data Scientist, Munin Data

Session

Design an Arrow Library to be `Async`

David Li

David Li

Software Engineer, Voltron Data

Session

Arrow Flight SQL: Accelerating Database Access

Tianyu Li

Tianyu Li

PhD Student, Massachusetts Institute of Technology

Session

Mainlining Databases: Supporting Fast Transactional Workloads on Apache Arrow

Leo Meyerovich

Leo Meyerovich

Founder, Graphistry, Inc.

Session

Building the FIrst GPU Visual Graph AI Platform with End to End Apache Arrow

Thomas Mock

Thomas Mock

Customer Enablement Lead, RStudio

Session

Efficient Data Analysis on Larger-than-Memory Data with DuckDB and Arrow

Dominik Moritz

Dominik Moritz

Faculty at Carnegie Mellon University Human-Computer Interaction Institute and ML Researcher, Apple.

Session

Apache Arrow on the Web and Beyond

John Murray

John Murray

Director, Fusion Data Science and Visiting Professor, Data Science Lab, University of Liverpool

Session

Using Arrow, with Numba KerneIs, to Generate AI Workflows

Danielle Navarro

Danielle Navarro

Developer Advocate, Voltron Data

Session

Doing More with Data: An Introduction to Arrow for R Users

Weston Pace

Weston Pace

Software Engineer, Voltron Data

Session

Acero: An Arrow native C++ streaming query engine

Pedro Pedreira

Pedro Pedreira

Software Engineer, Meta

Session

Velox: An Open-Source Unified Execution Engine

Hussain Sultan

Hussain Sultan

Field Engineering Director, Voltron Data

Session

Ibis and Substrait: Standardized Analytics

Paul Taylor

Paul Taylor

Senior Software Engineer, NVIDIA

Session

Apache Arrow on the Web and Beyond

Radu Teodorescu

Radu Teodorescu

Vice President Data Technology, WorldQuant

Session

A New Hope For The Big Data Divergence

Matt Topol

Matt Topol

Staff Software Engineer, Voltron Data

Session

GraphQL and Apache Arrow: A Match Made in Data

Joris Van den Bossche

Joris Van den Bossche

Software Engineer, Voltron Data

Session

Accelerating Geospatial Computing in R and Python Using Apache Arrow

Matthias Vallentin

Matthias Vallentin

CEO and Co-Founder, Tenzir

Session

When Data Engineering Meets Security Analytics

Wenlei Xie

Wenlei Xie

Research Scientist, PyTorch at Meta

Session

Torch Arrow Performant ML Preprocessing

Randy Zwitch

Randy Zwitch

Head of Developer Relations, Streamlit

Session

All in on Apache Arrow