I haven’t been writing much recently, because I have been very busy getting a couple of StartUp projects off the ground. Building tech from scratch is very different from what I have usually done during my career, which is work with large and mature systems.
It has been a real adventure and we have had lots of twists and turns along the way. We started off rising to a challenge set by the Autorité des marchés financiers (AMF) – the financial regulator of Quebec. They sponsored the FinTech Formathon and gave us a 3-month salaried runway to get the project off the ground at Concordia University’s District 3. The main issue their analysts and researchers have with existing news services is that they don’t do enough to avoid serving up the same story – albeit often in a slightly different form – over and over again. For a busy researcher, reading the same thing twice is very annoying – a problem that “churnalism” or “copy paste” journalism has exacerbated. Our first prototype for the AMF was a news search engine that clustered similar articles together, to help researchers identify when they already knew enough about a story.
We went on to develop a more sophisticated prototype, with a more interesting UI, and we are now building out our MVP which will use some machine learning techniques to improve relevancy and article similarity detection. We will work closely with the AMF to curate sources particularly relevant to them.
News services for businesses
We have also built a news search product that can be tailored to any subject area and is quick and easy to use. This will appeal particularly to organizations who want a lightweight straightforward way to keep up with news trends and hot topics.
Instant news – The Daily Snap
We then realised that ordinary people want to keep up with the news and are just as frustrated as the professional researchers. People want to know what is going on without spending too much time reading, but the level of trust in social media to provide quality news has plummeted. The problem with social media as a news source is twofold – on the one hand, free-to-use services need advertising revenue, and so what you see is ultimately what the “advertisers” want you to see. (In old media days, advertisers were usually large retailers and corporations because TV, radio, and print media buying was a convoluted process. Now anyone from anywhere in the world can pay a social media company to promote anything at all – even if they are Macedonian teenagers).
The second problem is that social media is a huge time suck. You might just want to glance at the headlines, but once you open up your social media app, it is almost impossible not to spend longer wandering around than you intended. No one wants to be left out when everyone else is talking about a hot news story, but no one wants to lose hours of their life to trivia either. This is why we created The Daily Snap. It is “instant news” – five headlines in your email inbox, so you can keep up with what’s going on in as little time as possible.
The Daily Snap will help us understand how people interact with their daily news and will help us develop our main product – a personalized, ad free, data secure, privacy respecting, high quality news service.
It has been a lot of fun diving into dataset classification for machine learning. My taxonomy skills have certainly proved extremely useful in helping us categorize articles and I will write more about the semantic aspects of our technology and our fantastic team in a future post.