Archives of #Wikipedia

Visualising the internals of Logistic Regression on a Text Matrix

Below I have some plots that visualise the term matrix (as a binary matrix and as a TF-IDF matrix) for the brand disambiguation project followed by a visualisation of the coefficients used in scikit-learn’s LogisticRegression classifier using l1 and l2 penalties. Using a CountVectorizer with binary=True we can mark the absence or presence of a […]

Thoughts from a month’s backpacking honeymoon

I’m publishing this on the hoof, right now we’re in Istanbul near the end of our honeymoon back home. Here are some app-travelling notes (for our Nexus 4 Androids). Google Translate offers Offline dictionaries for all the European languages, each is 150mb. We downloaded new ones before each country hop. Generally they were very useful, […]

Kinect depth maps and Python

I had the opportunity to play with a Kinect over the weekend, I wanted to test out depth mapping using the built in infra red cameras. Using a structured light approach is different to the stereopsis approach I was looking at with Kyran recently. Using the open source drivers for Ubuntu I quickly got the […]

Demos for Botanical Garden Label Matcher from StrongSteam

After a fair bit of graft we’ve finished our first product using StrongSteam – a Latin Botanical Garden label matcher (AKA “OpenPlants”) which runs at Kew Gardens, Wakehurst Place and other botanical gardens in Europe that use the usual black rectangular labels. If you’re not sure what I’m talking about then these 30 second demo […]

PyCon 2012 notes from the end

PyCon 2012 is just coming to a close. There were over 2,200 people here and too many talks to choose between. It was a bloody fine conference. Meeting so many of the Names of the Python world was rather grand, teaching High Performance Computing and getting pats on the back for the creation of ShowMeDo […]

strongsteam – an “AppStore for A.I. and data mining tools”

Kyran and I are starting work on a new project – strongsteam offers a web API with artificial intelligence and data mining tools. The goal is to make it easy for you to do things like: get the text out of images using optical character recognition determine whether two images look the same and if […]

Printable local data sheet for visitors?

Here’s a simple idea to help visitors to a new area.  Maybe it’s been done before and someone can leave a comment about it? The problem – when you visit a place you don’t know you have no idea what you need to see, where to get a map, which pubs and cafes are nice, […]