About

Ian Ozsvald picture

This is Ian Ozsvald's blog (@IanOzsvald), I'm an entrepreneurial geek, a Data Science/ML/NLP/AI consultant, founder of the Annotate.io social media mining API, author of O'Reilly's High Performance Python book, co-organiser of PyDataLondon, co-founder of the SocialTies App, author of the A.I.Cookbook, author of The Screencasting Handbook, a Pythonista, co-founder of ShowMeDo and FivePoundApps and also a Londoner. Here's a little more about me.

High Performance Python book with O'Reilly View Ian Ozsvald's profile on LinkedIn Visit Ian Ozsvald's data science consulting business Protecting your bits. Open Rights Group

21 January 2014 - 23:43PyData London conference keynotes and topics coming together

We’ve got our keynoters for PyData London (Feb Fri 21- Sun 23):

  • Gael Varoquaux (INRIA) with “Building a Cutting-Edge Data Processing Environment on a Budget”
  • Felix Fernandez (Deutsche Börse) with “Python in the Financial Industry: The Universal Tool for End-to-End Development”

Gael is a core committer for scikit-learn and Felix is the Business CIO for the Cash & Derivatives IT department.

Along with the keynoters we have a nice set of talks lining up (the Call for Proposals is open!), the topics will probably include:

  • Javascript frameworks for data visualisation
  • Statistical approaches to problem solving
  • An economist’s view into data science
  • Art in the realm of Big Data
  • Data clustering techniques
  • High performance Python processing

Our Call for Proposals is open until the end of the month, I’m really keen to see stories and tutorials around the solving of interesting problems with data. Whilst the conference is themed for Python I’m keen to see proposals that use other languages (or no language – art & stats!) to do interesting things around data. I’m more focused on interesting topics and lively discussion. Are there any R, Matlab and Julia users who’d like to share their experience?

Please do consider putting forward a proposal, the conversation that will come out of the conference looks to be rather interesting already. If you’ve never spoken at a conference before then this would be a rather ideal place to start.


Ian applies Data Science as an AI/Data Scientist for companies in ModelInsight, sign-up for Data Science tutorials in London. Historically Ian ran Mor Consulting. He also founded the image and text annotation API Annotate.io, co-authored SocialTies, programs Python, authored The Screencasting Handbook, lives in London and is a consumer of fine coffees.

8 Comments | Tags: Python

14 January 2014 - 23:05Installing the numpy module in PyPy

Working on the High Performance Python book (mailing list here for our occasional announces) I’ve reinstalled PyPy a couple of times, each time I forget how to install the numpy module. Note that PyPy’s numpy is different and much smaller than CPython’s numpy. It does however work for smaller problems if you just need some of the core features (i.e. not the libs that numpy wraps). It used to be included in a branch, now it comes as a separate package.

I’m posting this as a reminder to myself and maybe as  a bit of help to another intrepid soul. The numpy PyPy install instructions are in this Nov 2013 blog post. You need to clone their numpy repo and then install it as a regular module using the “setup.py” that’s provided (it takes just a couple of minutes and installs fine). Download PyPy from here and just extract it somewhere.

Having installed it I can:
$ ../bin/pypy 
Python 2.7.3 (87aa9de10f9c, Nov 24 2013, 18:48:13)
[PyPy 2.2.1 with GCC 4.6.3] on linux2
Type "help", "copyright", "credits" or "license" for more information.
And now for something completely different: ``it seems to me that once you
settle on an execution / object model and / or bytecode format, you've already
decided what languages (where the 's' seems superfluous) support is going to be
first class for''
>>>> import numpy as np
>>>> np.__version__
'1.8.0.dev-74707b0'

From here I can use the random module and do various vectorized operations, it isn’t as fast as CPython’s numpy for the Pi example I’m working but it does work. Does anyone know which parts offer comparable speed to its bigger brother?


Ian applies Data Science as an AI/Data Scientist for companies in ModelInsight, sign-up for Data Science tutorials in London. Historically Ian ran Mor Consulting. He also founded the image and text annotation API Annotate.io, co-authored SocialTies, programs Python, authored The Screencasting Handbook, lives in London and is a consumer of fine coffees.

8 Comments | Tags: High Performance Python Book, Python

13 January 2014 - 18:50PyData London (Feb 21-23 2014)

PyData is coming to London, this’ll be the first PyData in Europe. The conference will focus on Python for Data Analytics, quite like SciPy and EuroSciPy but with a bit more of a focus on business rather than science (but only a bit, I rather like the science).

170 videos from past conferences are available free online, all the content from this conference will go online afterwards. Obviously a big part of the conference is meeting people in-person, it’ll be held in Level39 in Canary Wharf at One Canada Place.We’re looking for talk submissions – the timeline is short so please respond to the Call for Proposals by the end of Jan (that’s 2.5 weeks away). Submissions around data analysis, “big data” (“interesting data” is more what I’d like to see) and visualisation would be ace, tools like numpy, Pandas, Cython, Numba, IPython, Vincent etc obviously are all very relevant.

Sponsorship for a London event is rather reasonable, if you’re looking to hire or raise your profile to London data/Python people then contact Leah via the Admin address on the sponsorship page, you’ll get good exposure for less than the cost of hiring a recruiter and you’ll have direct access to people who help write the tools that we all use.


Ian applies Data Science as an AI/Data Scientist for companies in ModelInsight, sign-up for Data Science tutorials in London. Historically Ian ran Mor Consulting. He also founded the image and text annotation API Annotate.io, co-authored SocialTies, programs Python, authored The Screencasting Handbook, lives in London and is a consumer of fine coffees.

7 Comments | Tags: Python