Andrus Adamchik: DataFrame – a Swiss Army Knife of Java Data Processing

Wednesday, June 12

Details
5:30-6:00: Food, Soda, Beer and Networking

6:00-6:15: Announcements

6:15-7:45: DataFrame – a Swiss Army Knife of Java Data Processing

As Java developers, we do a lot of data processing. If you have terabytes pumped through your system daily, maybe you would reach for Spark, Flink or some other “big data” solution. But there are also many everyday tasks that do not warrant the complexity of traditional data pipelines. Some examples are analysis of app logs, cleaning up and persisting Excel files, simple ETL copying tables between different databases, etc. So, how can you use “big data” techniques without big data infrastructure?

This talk will focus on “DataFrame” – an in-memory table-like data structure with operations including column / row filtering and transformations, joins, aggregations, etc. I will use an open source DFLib library (https://dflib.org) and Jupyter notebook to demonstrate how to do data processing in any Java app without much fuss.

About Andrus Adamchik

Andrus is a passionate open-source developer and a member of the Apache Software Foundation. He started programming in Java back in 1998, and founded a number of open-source projects: Apache Cayenne – a developer-friendly ORM, Bootique.io – a lightweight Java app platform, Agrest.io – a framework for dynamic REST services, and DFLib – DataFrame for Java. In his day job, Andrus is an IT entrepreneur, running a software company called ObjectStyle.

You can find Andrus on Twitter @andrus_a.

7:45: Door prizes

This entry was posted in Uncategorized. Bookmark the permalink.

Comments are closed.