PyData London 2023

Ian Ozsvald

Ian is a Chief Data Scientist, has helped co-organise the annual PyDataLondon conference raising $100k+ annually for the open source movement along with the associated 11,000+ member monthly meetup. Using data science he's helped clients find $2M in recoverable fraud, created the core IP which opened funding rounds for automated recruitment start-ups and diagnosed how major media companies can better supply recommendations to viewers. He gives conference talks internationally often as keynote speaker and is the author of the bestselling O'Reilly book High Performance Python (2nd edition). He has over 25 years of experience as a senior data science leader, trainer and team coach. For fun he's walked by his high-energy Springer Spaniel, surfs the Cornish coast and drinks fine coffee. Past talks and articles can be found at:

The speaker's profile picture

Sessions

06-03
11:00
40min
Executives at PyData
Ian Ozsvald

Executives at PyData is a facilitated discussion session for executives and leaders to discuss challenges around designing and delivering successful data projects, organizational communication, product management and design, hiring, and team growth.

We'll announce the agenda at the start of the session, you can ask questions or raise issues to get feedback from other leaders in the room, NumFOCUS board members and Ian and James.

Organized by Ian Ozsvald (London) and James Powell (New York)

Beaumont
06-03
15:45
40min
Pandas 2, Dask or Polars? Quickly tackling larger data on a single machine
Giles Weaver, Ian Ozsvald

Pandas 2 brings new Arrow data types, faster calculations and better scalability. Dask scales Pandas across cores. Polars is a new competitor to Pandas designed around Arrow with native multicore support. Which should you choose for modern research workflows? We'll solve a "just about fits in ram" data task using the 3 solutions, talking about the pros and cons so you can make the best choice for your research workflow. You'll leave with a clear idea of whether Pandas 2, Dask or Polars is the tool for your team to invest in.

Salisbury
06-03
16:30
40min
Discussing Higher Performance Python (Birds of a Feather session)
Ian Ozsvald

This discussion session is for anyone using Python for higher performance work. You probably use Pandas, NumPy, Polars, Dask, Vaex, Modin, cuDF or any of the related tools, you've got questions, you want to know what other people are using, what's pragmatic and where new opportunity might exist.
This will be a guided discussion, we'll vote on topics at the start of the session and then host Ian will work through the list.

Beaumont