Archives of #Fonts

Extracting keyword text from screencasts with OCR

Last week I played with the Optical Character Recognition system tesseract applied to video data. The goal – extract keywords from the video frames so Google has useful text to index. I chose to work with ShowMeDo‘s screencasts as many show programming in action – there’s great keyword information in these videos that can be […]