On Wednesday Bart and I spoke at SkillsMatter to 75 Pythonistas with an Introduction to Data Science using Python. A video of the 4 talks is now online. We covered:
- High Performance Python (profiling, line_profiler, memory_profiler, Cython, Numba)
- Natural Language Processing and Machine Learning (scikit-learn for brand detection) – based on my longer talk at PyConUK a couple of months back
- Solving the Titanic Kaggle competition using an IPython Notebook (with scikit-learn and Pandas) [IPython notebook to follow] – use this github project to get started
- Solving the Future Cities Parking Data Hackathon (Bart Baddeley) – nbviewer for the Notebook and Pandas (based off our earlier write-up)
Since the group is more of a general programming community we wanted to talk at a high level on the various ways that Python can be used for data science, it was lovely to have such a large turn-out and the following pub conversation was much fun.
Ian is a Chief Interim Data Scientist via his Mor Consulting. Sign-up for Data Science tutorials in London and to hear about his data science thoughts and jobs. He lives in London, is walked by his high energy Springer Spaniel and is a consumer of fine coffees.
16 Comments