Entrepreneurial Geekiness

“Flying Pandas” and “Making Pandas Fly” – virtual talks this weekend on faster data processing with Pandas, Modin, Dask and Vaex

This Saturday and Monday I’ve had my first experience presenting at virtual conferences – on Saturday it was for Remote Pizza Python (brilliant line-up!) and on Monday (note – this post predates the talk, I’ll update it tomorrow after I’ve spoken) at BudapestBI. UPDATE added 2nd variant of Making Pandas Fly for a short-notice PyDataUK […]

Ian

4 years ago

Recent “week notes”

I’ve not done a public “week notes” before. I’ve been hacking on various things and I figure it is worth sharing some of it. Using public Companies House data I’ve started to plot the decline in new company formations in the UK. Here’s a first crack, which shows a decline at the end of March. […]

Ian

4 years ago

New Higher Performance Python class (June 1-3)

I’ve listed my next Higher Performance Python public class, it’ll run online for 3 mornings on June 1-3 during UK hours. We’ll use Zoom and Slack with pre-distributed Notebooks and modules and you’ll run it using an Anaconda environment. Here’s the write-up from my recent class. We’ll focus on Profiling to find what’s slow in […]

Ian

4 years ago

Notes on last week’s Higher Performance Python class

Last week I ran a two-morning Higher Performance Python class, we covered: Profiling slow code (using a 2D particle infection model in an interactive Jupyter Notebook) with line_profiler & PySpy Vectorising code with NumPy vs running the original with PyPy Moving to Numba to make iterative and vectorised NumPy really fast (with up to a […]

Ian

4 years ago

Notes from Zoom call on “Problems & Solutions for Data Science Remote Work”

On Friday I held an open Zoom call to discuss the problems and solutions posed by remote work for data scientists. I put this together as I’ve observed from my teaching cohorts and from conversation with colleagues that for anyone “suddenly working remotely” the process has typically not been smooth. I invited folk to join […]

Ian

4 years ago

Another Successful Data Science Projects course completed

A week back I ran the 4th iteration of my 1 day Successful Data Science Projects course. We covered: How to write a Project Specification including a strong Definition of Done How to derisk a new dataset quickly using Pandas Profiling, Seaborn and dabl Building interactive data tools using Altair to identify trends and outliers […]

Ian

4 years ago

Higher Performance Python (ODSC 2019)

Building on PyDataCambridge last week I had the additional pleasure of talking on Higher Performance Python at ODSC 2019 yesterday. I had a brilliant room of 300 Pythonic data scientists at all levels who asked an interesting array of questions: This talk expanded on last week’s version at PyDataCambridge as I had some more time. […]

Ian

4 years ago

Training Courses for 2020 Q1 – Successful Data Science Projects & Software Engineering for Data Scientists

Early next year I run new iterations of two of my existing training courses for Pythonic Data Scientists: Successful Data Science Projects (Jan, 1 day) Software Engineering for Data Scientists (Feb, 2 day) Successful Data Science Projects focuses on reducing uncertainty in a new data science project. We’ll look at the reasons why these projects […]

Ian

4 years ago

“Higher Performance Python” at PyDataCambridge 2019

I’ve had the pleasure of speaking at the first PyDataCambridge conference (2019), this is the second PyData conference in the UK after PyDataLondon (which colleagues and I co-founded 6 years back). I’m super proud to see PyData spread to 6 regional meetups and now 2 UK conferences. We had over 200 attendees and the conference […]

Ian

4 years ago

“A starter data science process for software engineers” – talk at PyLondinium 2019

I’ve just spoken on “A starter data science process for software engineers” (slides linked) at PyLondinium 2019, this talk is aimed at software engineers who are starting to ask data related questions and who are starting a data science journey. I’ve noted that many software engineers – without a formal data science background – are […]

Ian

5 years ago

All posts of Ian

“Flying Pandas” and “Making Pandas Fly” – virtual talks this weekend on faster data processing with Pandas, Modin, Dask and Vaex

Recent “week notes”

New Higher Performance Python class (June 1-3)

Notes on last week’s Higher Performance Python class

Notes from Zoom call on “Problems & Solutions for Data Science Remote Work”

Another Successful Data Science Projects course completed

Higher Performance Python (ODSC 2019)

Training Courses for 2020 Q1 – Successful Data Science Projects & Software Engineering for Data Scientists

“Higher Performance Python” at PyDataCambridge 2019

“A starter data science process for software engineers” – talk at PyLondinium 2019

Navigation

Recent Posts

About Ian