Entrepreneurial Geekiness

Ian is a London-based independent Chief Data Scientist who coaches teams, teaches and creates data products. More about Ian here.
Entrepreneurial Geekiness
Ian is a London-based independent Chief Data Scientist who coaches teams, teaches and creates data products.
Coaching
Training
Jobs
Products
Consulting

“On the Delivery of Data Science Projects” – talk at PyDataCambridge meetup

A few weeks I got to speak at PyDataCambridge (thanks for having me!), slides are here for “On The Delivery of Data Science Projects“.

This talk is based on my experiences coaching teams (whilst building IP for clients) to help them derisk, design and deliver working data science products. This talk is really in two halves – it takes the important lessons from my two training classes and boils them down into a 30 minute talk. We cover:

  • What makes for a successful data science project?
  • Developing a Project Specification for shared agreement including a Definition of Done
  • Using standard tools and processes to standardize and simplify
  • Ideas around best practice

Let me know if you found this talk useful? I really think the ideas around successful project delivery need to be collected and shared, we’re still in the “wild west” and I’m keen to collate more examples of successful process.

 


Ian is a Chief Interim Data Scientist via his Mor Consulting. Sign-up for Data Science tutorials in London and to hear about his data science thoughts and jobs. He lives in London, is walked by his high energy Springer Spaniel and is a consumer of fine coffees.
Read More

Thoughts on how to start a PyData or Python meetup

At PyConLT 2019 (Lithuania) we just had a 10-person meeting on “how to start a new PyData or Python meetup” with existing organisers and some potential new event organisers. The night before in the conference bar Radovan and I had spent an hour helping someone from Latvia figure out their plan to start a new Python meetup. Given that I’m a co-founder for PyDataLondon and after 6 years we’re at 9,500+ members, I have some opinions. Maybe sharing these will help others. All going well we’ll see a new PyDataVilnius start with what looks like a 7+ person volunteer group, all organised at PyConLT.

I’m pretty sure that the key to starting a meetup is to have:

  • lots of marketing (I can’t stress this enough – to kick start a new event you have to help relevant people find you)
  • regular meetups (e.g. monthly or e.g. every 3 months – whatever fits your community and your level of engagement)
  • high quality speakers
  • the same venue every month (it is easier for attendees and much easier for the organisers to have a fixed venue)

None of the above are “magic”, they each help. Regular meetups mean people put them into the calendar. High quality speakers mean people get used to turning up because “they’ll either learn something or meet other interesting attendees” – I’m really proud of the quality of speakers we’ve had consistently over 6 years at the PyDataLondon meetups. The same venue means there’s no confusion about how to find you for the attendees and there’s no extra effort for the organisers to have to keep finding and negotiating new venues.

For the first few months keep your life simple – get setup on meetup.com (if you’re doing a PyData you can email NumFOCUS to ask for an official PyData branch to be created) or have a landing page (/blog/mailing list/facebook page). Find a couple of good speakers – ask around, you’ll quickly be surprised at how good your existing network already is. Choose a regular date (PyDataLondon is “the first Tuesday of the month” and we’ve rarely broken that rule). Find yourself a couple of co-organisers – 3 co-organisers is great, 1 is too much of a burden for 1 person I think.

Don’t stress too much about the first venue. You’ll probably grow out of it so you just need a stable base to start from. I’ve run prior meetups out of pub function rooms, renovated Victorian townhouses and co-working spaces. PyDataLondon started at Pivotal (thanks Ian!), then moved to Lyst (thanks Steve!) and then for 3+ years at Man AHL (super thanks Slavi, Gary and colleagues!).

Next – publicise it. Tell all of your friends who’d be even vaguely interested and ask them to tell three friends. Post to Facebook, Twitter, your favourite forums/slack/whatsapp groups. Ask friends in companies or at universities to advertise internally. Go to allied meetups that are already running and ask the organisers if you could have 2 minutes on-stage to announce to their audience (and of course offer to publicise their event to your audience in the future – always reciprocate).

If you’re stuck for speakers – look at the history of speakers at other related meetups and conferences, then reach out to them. Tell your friends that you’d like to reach persons Y and X and maybe they can do you intros.

At each meetup remind everyone that you’re a volunteer groups – so many attendees don’t realise that the organisers & speakers aren’t paid to do this. Remind them that you’re all volunteers giving up your time, ask them to thank the speakers and your colleagues (and maybe to buy them beer or coffee). At the end of the event get your volunteers and speakers to stand and ask everyone to thank them (and maybe buy them a beer in the pub after) – make sure your volunteers are acknowledged. Also remind everyone “we want new speakers – please come and have a chat with the volunteers!” and you’ll develop a feed of people willing to speak for 0 extra effort on your part.

At PyDataLondon we get lots of first-time attendees now (maybe 30% of the room are first-timers now). I’ve taken inspiration from other meetups (e.g. LinuxingInLondon and others) to make the start ‘more friendly’. We have 200 people in the room, many of whom don’t know who they sit next to. We do a 1 minute intro section where “you turn to your neighbour and ask ‘What do you do with Python?’ and then they reciprocate” – so then everyone has someone to talk to in the break. I also get and pass aroud “green newbie badges” – these are stickers that any newbie can stick on themselves, this gives them the super power to go and join any conversation in the breaks. I also get stickers for the speakers & volunteers (robots, unicorns, planets – the London Science Museum is great for these). These techniques work nicely.

Think about your motivations for running a meetup. It takes a lot of sustained effort to make it consistent and then it just takes effort to keep it going. You’ll want to grow an organising group to keep it maintainable. Ask yourself – why do you and your colleagues want to run this group? Make sure you’re all in alignment about your core values, else you may be ignoring issues which might make things weird later. Thankfully the groups I’ve been involved in haven’t “gone weird” but I’ve seen it happen (e.g. founders who won’t let go of the reigns and then burn out), you can sensibly avoid some of these problems by having frank discussions up front.

Later on you may need to scale up your organising group. At PyDataLondon we’ve been blessed with a super feed of lovely volunteers – we run a committee with circa 13 organisers. My earlier meetup “fivepoundapps” was an awful lot of fun but my friend John and I realised after some years that nobody else wanted to run the event – no amount of cake baking (John) and beer carrying (me) would change the feel of the event from “John and Ian’s fivepoundapp” which was a pity. When we burned out it just died. Happy memories, but a pity that it didn’t sustain.

For PyDataLondon I put thought into sustainability and our organisers discussed this as we grew. In the background we have a rotating Release Manager (they check with the venue & speakers, send out a “please unRSVP if you can’t attend” email and do some housekeeping – essential and mostly invisible work), several of us look for new speakers, we used to do a lot of publicity but thankfully that’s mostly automatic now. I write an “update email” most months to our 9k+ members, that takes 1-2 hours to write and includes some links about upcoming relevant events and often requests new speakers. All the volunteers rotate to be on stage, not everyone is comfortable with that (and sadly some of us – myself, Emlyn, Marco – are more comfortable with it than others and we get he limelight) but we try to make sure that all volunteers get some visibility. I don’t know a better way to do this, I’m open to feedback.

You don’t have to start big, you just have to be consistent and it’ll grow sensibly over time. We’re 6 years in now with PyDataLondon and it just seems to keep growing. Build a nice small meetup, let it grow, delegate and find colleagues who want to help it grow and…let it. Beer generally helps (+wine+soft drinks+pizza+suitable alternative food – suit the needs of your audience). A code of conduct is very sensible (NumFOCUS sensibly insist on this for PyData events). Keep telling people. Find high quality speakers. Rinse and repeat. Enjoy the new community. Sometimes I joke that the reason I’ve put in 6 years of effort is, on a personal level, because I couldn’t find 50 interesting folk to come to the pub with me – now I get it for free every month. That’s magic.

Thanks for Radovan of PyDataBratislava, Jan of PyDataPrague, Aidis of PyConLT for direct feedback yesterday, to my colleagues at PyDataLondon and global PyData events and to others who have helped me figure out these approaches over the years.

ps. written at the airport – please excuse any typos!


Ian is a Chief Interim Data Scientist via his Mor Consulting. Sign-up for Data Science tutorials in London and to hear about his data science thoughts and jobs. He lives in London, is walked by his high energy Springer Spaniel and is a consumer of fine coffees.
Read More

PyCon Lithuania 2019 and a keynote on “Citizen Science with Python”

I’ve had the great pleasure of attending PyConLT 2019 – my first trip to Lithuania. I had no idea what to expect (I’ve never been to this part of Europe) – Vilnius is a lovely city full of lovely Pythonistas. There’s a bunch of lovely art hanging underneath bridges, an amazing Soviet Palace of Arts and Sports and a number of castles – it really is very lovely here.

This keynote talk builds on a couple of previous iterations, here I spoke on “Citizen Science with Python“:

  • Dirty North-Macedonian air analysis by Gorjan (from PyDataAmsterdam)
  • Improving child births via Anna (from PyDataWarsaw)
  • Monitoring saved and relocated Orangutangs using SciPy and drones via Dirk (from PyDataAmsterdam)
  • Improving political engagement in the run up to the UK EU elections (via myself and colleagues at a political hackathon)
  • Short demo of using open data in a Notebook to visualise  the 30 year increase in new breweries in the UK

The keynote was aimed at a mostly-engineer audience, to show that “with the power of the keyboard” we all have skills to make the world a better place that most of our fellow citizens lack.

Whilst here I got the chance to get involved in two very interesting discussions:

  • How can others who want to start a meetup get some advice? This (hopefully) leads to the formation of a new PyDataVilnius (it was announced on stage in the closing notes – yay!) and the expansion of a similar group back in Cyprus (which has already had some positive feedback – yay!)
  • How can folk take the next step with their data science project which led to interesting chats over lunch

I spent most of my time on the PyData track and met a load of very smart local and international folk, organiser Aidis and his volunteer crew did a really good job of pulling this together. I also got to meet PyDataBratislava co-organiser Radovan and PyDataPrague co-organiser Jan (we had the pleasure to meet last year when I went to their first meeting). These events really do help to build such a great international network.

 


Ian is a Chief Interim Data Scientist via his Mor Consulting. Sign-up for Data Science tutorials in London and to hear about his data science thoughts and jobs. He lives in London, is walked by his high energy Springer Spaniel and is a consumer of fine coffees.
Read More

Software Engineering for Data Scientists and Successfully Delivering Data Science Projects – 2 courses for May

In May I’m running 2 courses:

The first is aimed at data scientists who have had a bad or wobbly delivery who want to learn better ways to design projects, derisk their stages and deliver more reliably. After the course you’ll have a guide to writing sensible project specifications, a new process to follow, new tools to help your team write stronger data science solutions and you’ll have had Q&A time to get your own questions answered.

The second is aimed at data scientists who lack software engineering experience, who need practice in writing stronger code, documentation, readability, collaborative practices, testing and trustworthiness of their code to be able to deliver useful production-ready systems. After the course you’ll write stronger code, you’ll be able to supportively critique the code of your colleagues and you’ll be able to have stronger conversations with the engineering team at your organisation.

Both courses use a slack channel which is live after the training, past students are still asking questions and getting answers from other students – by attending you’ll have access to this resource and we have healthy Q&A time in the courses to get all of your uncertainties resolved so you can take answers back to your office.

“One of the highlights from Ian’s Successfully Delivering Data Science Projects course was being introduced to the concept of a specialised project specification document. This provides a systematic framework to directly tackle numerous problems I have experienced when trying to move a project beyond an initial prototyping stage. I have now applied my own tailored specification document at my organisation and it immediately surfaced critical questions and issues that otherwise would not have been realised for months.” – Thomas Brown, Data Scientist at aire.io

Both are linked on my training page, there’s an email list there where I first announce new courses.


Ian is a Chief Interim Data Scientist via his Mor Consulting. Sign-up for Data Science tutorials in London and to hear about his data science thoughts and jobs. He lives in London, is walked by his high energy Springer Spaniel and is a consumer of fine coffees.
Read More

Second Successfully Delivering Data Science Projects just over

I ran the second iteration of my Successfully Delivering Data Science Projects course last Friday to this happy group, we had a lovely day and good conversation has continued in the teaching slack over the weekend:

Topics covered included the design and derisking of data projects (not just machine learning), building a project plan, communicating effectively with non-data science stakeholders, cost/benefits analysis, tooling, hiring and the process of actually getting R&D models shipped and supported. The conversation was both similar and nicely different to the Q&A topics that came up in the previous iteration.

I haven’t got a date yet for the next iteration, it’ll be in a couple of months. I may also run something on Software Engineer for Data Scientists and another on Higher Performance Python, possibly I’ll run one on NLP with a colleague. You can sign-up for notifications on my very low-volume mailing list if you’d like to hear when courses become available.

I’m doing research into which other topics might be most pressing in London – I’d be curious to know what you’d like to learn in the next 6 months if a course were run in London. The survey attached has 1 question with 1 choice and there’s no sign-up. There is a second question box to take your email (entirely optional) which will get you a discount code valid for the year.


Ian is a Chief Interim Data Scientist via his Mor Consulting. Sign-up for Data Science tutorials in London and to hear about his data science thoughts and jobs. He lives in London, is walked by his high energy Springer Spaniel and is a consumer of fine coffees.
Read More