Archives of #Nbsp

PyConUK 2013

I’m just finishing with PyConUK, it has been a fun 3 days (and the sprints carry on tomorrow). Yesterday I presented a lightly tweaked version of my Brand Disambiguation with scikit-learn talk on natural language processing for social media processing. I had 65 people in the room (cripes!), 2/3 had used ML or NLP for […]

Some Natural Language Processing and ML Papers

After I spoke at DataScienceLondon in June I was given a set of paper references by a couple of people (the bulk were by Levente Török) – thanks to all. They’re listed below. Along the same lines I have one machine learning paper aimed at beginners to recommend (“A Few Useful Things to Know about […]

Visualising London, Brighton and the UK using Geo-Tweets

Recently I’ve been grabbing Tweets some some natural language processing analysis (in Python using NetworkX and NLTK) – see this PyCon and PyData conversation analysis. Using the London dataset (visualised in the PyData post) I wondered if the geo-tagged tweets would give a good-looking map of London. It turns out that it does: You can […]

Analysing #pydata, London and Brighton tweets for concept mapping

Below I’ve visualised tweets for #PyData conference and the cities of London and Brighton – this builds on my ‘concept cloud‘ from a few days ago at the #PyCon conference. Props to Maksim for his Social Media Analysis tutorial for inspiration. Update – Maksim’s Analying Social Networks tutorial video is online. For the earlier #PyCon […]

Testing 3 modern face detection libraries (, openCV, libccv)

As a research project months back Balthazar and I tested 3 modern face detection libraries (definitely see Balthazar’s write-up). had just been acquired by facebook, they had a great and free service which annotated not just face locations but also sex, age and emotion. We also tested OpenCV (popular and free) and the lesser […]

aMaking “from lxml import etree” work with virtualenv (Python)

Update – these steps are overly complicated and *unnecessary*! See fizyk and Marius’ comments below. I’ll leave this post just in case it helps anyone – hopefully anyone coming here will realise it isn’t hard (now) to install lxml, as long as the OS dependencies are installed I use virtualenv for all development. Recently I […]

Kinect depth maps and Python

I had the opportunity to play with a Kinect over the weekend, I wanted to test out depth mapping using the built in infra red cameras. Using a structured light approach is different to the stereopsis approach I was looking at with Kyran recently. Using the open source drivers for Ubuntu I quickly got the […]

Demos for Botanical Garden Label Matcher from StrongSteam

After a fair bit of graft we’ve finished our first product using StrongSteam – a Latin Botanical Garden label matcher (AKA “OpenPlants”) which runs at Kew Gardens, Wakehurst Place and other botanical gardens in Europe that use the usual black rectangular labels. If you’re not sure what I’m talking about then these 30 second demo […]

Data mining/AI/robots/hackerspace meet-up this Thursday

This Thursday at 7pm our StrongSteam will run a friendly pub meet around: Data mining Artificial Intelligence (AI) Robots Hackerspaces The goal is to bring people together from StartupChile and the local community who are interested in the above subjects. The meeting is just a pub meetup, if there’s demand then I’ll organise speakers for […]