A piece of advice for news sites operators: invest money in a real recommendation engine, tag-based, social, or even semantic filtering. Readers will stay longer on your site, increasing the value of their visits. On average, major news sites don’t get more than 3 to 4 pages per visit. Sadly, those who manage to go well above those numbers rely on lame tricks such as increasing the page auto-refresh rate or stuffing their site with click-intensive items: online games or slide shows.
This is not an effective tactic. When you artificially increase the page view rate, your visitor spends less time on each page. With less time spent on a given page, the result is lower click-rates on advertising banners, less revenue. A much better tactic is increasing the “real” number of page views per visit, page views resulting from news content actually relevant to the visitor. As a result, ads will be presented in a much more favorable context or, alternatively, content will be more likely to be paid for.
Don’t tell me what, tell me how… Simple answer: deploy recommendation engines comparable to what exists, already, successfully, on e-commerce or entertainment sites.
Take Amazon. Perhaps the most successful example. I left many bread crumbs on Amazon, especially on the US and UK sites. Then, Amazon sends notifications, offers by email — among many similar ones from other online store I did business with. I systematically open Amazon’s email simply because I know they are the most relevant to my centers of interest. If I searched for books on architecture six months ago, Amazon will notify me when a new biography of Rem Koolhaas is released. When I visit the site, every time, the page makes suggestions based on my previous visits, displays catalog updates and comparable purchases from other customers. And if I put an item in my cart, Amazon proposes related bundles.
Another example: the music site Deezer.com. Starting with a single entry in the search engine, simply by bouncing from one artist to the other based on other people’s taste, I’ve been able to unearth artists I was fond of twenty years ago. This is what I call efficient recommendation engines. Pleasant and profitable.
Hence the question: Why are online newspapers unable to provide a similar experience? Let’s look at a real situation. This week, for my news consumption, I’ve been following items such as the painful developments of the stimulus package in the United States ; in France, the bailout of the car industry and the social unrest in the former French colonies ; in space, the collision between an old Russian satellite and an a American one. In most cases, I got no more than a lukewarm set of “related stories” that have been dug up and inserted manually, most of the time weakly relevant, unexciting.
Fact is: this system never gets close to taking advantage of the editorial depth of news sites adding dozens, if not hundreds of items per day. Digging deeper into a topic involves manually searching through the site or jumping elsewhere for further research. (Since I often end up in Wikipedia, I donated €30 hoping its crew will keep caring for its quality and usefulness).
A recommendation engine ought to be able to encode my interest in a subject into a vector. (OK, a list of criteria, dimensions, and numbers in front of each.) Did I just quickly scan the story? Did I spend six minutes reading it? Did I print it for later reading (that’s an interesting piece of information, easy to gather)? Did I send the story to a friend? Did I share it on Facebook? Did I click on one, or two related links? Did I bookmarked the URL for later retrieval? Did I look for the same topic just once or three times in the week? There are many ways to automatically measure my interest in a subject.
From such measurement, serving relevant material extracted from the archives isn’t rocket science and certainly not artificial intelligence: a box in the article page refreshes and offers relevant links (instead of serving me a new ad I don’t care for). Or, if I go back to the same subject at a later time, it adds suggested articles or multimedia items I missed — exactly as Amazon proposes new books or DVDs.
Coming back to this week’s hot topics, I could have had an in-depth story on US Treasury Secretary Tim Geithner’s management methods, a two-year old story in Le Monde reporting on the racial divide in the French Indies, or a feature on how the US Space Command saved the life of astronauts by preventing a collision with orbiting junk ten years ago. Without a doubt such technique would have increased my “stickiness” to my preferred sites and therefore the value of my visits.
“Off-the-shelf” technology is abundant. There are three categories of recommendation engines: Active, Social and Passive filtering.
Forget about the first one, the Active one: contrary to the “Daily Me” myth, no one will spend fifteen minutes stating his/her centers of interest by filling a form or checking 20 boxes of news-related preferences. You do that once, maybe, for your online grocery list, but not for a moving target such as news. Cereals are easier to define. The second type, social filtering, or social recommendation, is based on a statistical analysis of other people’s tastes as expressed by their consumption patterns. They liked this, then you might like it as well. It works fine for books, movies or music, not that well for news items, because of the unpredictability of the news cycle.
We’re left with the most applicable engine: Passive Filtering accumulates information from my previous browsing sessions. My navigation patterns draws a profile of my news interest that can be matched with other users, further enhancing the system. The older and thicker my history is, the better the engine will perform. It will work fine as I’m loyal to my trusted brands, whether it is Le Monde, 20 minutes, VGnett, or the New York Times. This leads to a virtuous circle that can also encompass the news outlets beyond my original trusted brand that it would aggregate for me. A kind of self-generated, reliable web ring.
There are technical hurdles. The most obvious is the complexity of the corpus. Thousands of articles covering a vast array of topics are not easy to categorize, as compared to a catalog of cultural goods in which an artist or an author can be connected to a genre. It is the opposite of a well-structured database. A headline won’t necessary help much from a retrieval capability standpoint. The essential notion of “tag” for a story is underdeveloped. The reason for such shortcoming is simple: journalists and editor are not taught to properly tag and categorize their stories. (The funny thing is that superblogs such as Henry Blodget’s Alley Insider, using a simple tag system, are much better than big news sites at encouraging the reader to stay on the site.)
Technical challenges are no excuse for doing nothing. The real hurdle is a cultural one. In many online newspapers I know of, the Internet culture is recent, almost a foreign object. What mostly occupies minds there is “transition” thinking, from print to the Internet. Technical questions are downgraded as secondary, in many instances left to the techies.
This is a mistaken way of looking at the problem. Yahoo!, for one, eBay, for another, are companies that failed to keep technology as a top-level, strategic concern, putting it down as the infamous “mere matter of implementation”. As a result, the Google and Amazon geeks ate their lunch. It is time to consider a merger between editorial and technical functions, to put both at the same top level. The result will be more valuable than the sum of the parts. —FF
Related columns:
- The tragic economics of ultra-small news sites Two days before heading the Elysée gathering, I had a conversation with the founder of a tiny French news sites called Bakchich. info. The guy’s name is Nicolas Beau. He has a hell of a track record in investigative reporting. He spent quite a while at le Canard Enchained, a satirical newspaper-like weekly known for [...]...
- Opening the News Warning: religious debate here. Should a news web site be open or closed, free or paid-for? There is no simple answer, of course, as hybrid models are a likely part of our future. But, first, let’s review the paid-for model I addressed in previous issues of the Monday Note as well as in the French [...]...
- News websites: More bucks for the click Here’s the tragedy of our business model: Swelling demand, shrinking revenue. News is much in demand in these turbulent days. The number of viewers is on the rise. And, at the same time, the money we manage to extract from these swelling audiences is shrinking inexorably. We are about to pay the price — so [...]...
- Top 30 Most Popular Newspaper Sites Now let’s have a look at the raw data, i.e. the US newspaper’s performance in January. It remains impressive. On the table below, the first number is the audience in unique visitors for the month, the second is the year-over-year change in % : NYTimes.com 20,461 — [...]...
- Paid news on Mobile. Why it could fly. This week, I downloaded the iPhone application of the British newspaper the Independent. It’s a new breed of app, taking advantage of the new features embedded in the third iteration of the iPhone OS. For a daily newsmedia, Push Notification is the most interesting new feature, combined, in this case, with an offline reader. On [...]...






8 Comments
Let’s start with your article ! I see no tag around it. The reason is : it’s a real job to ‘tag’ stuff like article or news.
Like you, editors need to hire new profiles instead of having more and more journalists. They should have a ‘library’ department with professionnal archivists among other techies. Editors should understand that Internet is not only a ‘realtime’ media but over all a (The) big library. Editors should understand that their content need non-journalist work on it to raise their value. Stan.
FF,
Although you’re right overall, I totally agree with Stan that recommendation engines require more than good will.
Changing a website from shovelware (even if it’s realtime, full of links and well indexed) into a library does require a shift in perspective. The value proposition has to change: A media isn’t there to add new stories every day; rather, it should organize all the content of its niche (while updating it in real time, that goes without saying).
Changing opinions is necessary but insufficient. Hiring a tagger would costs around 25k€ a year (or consider this expense the time spent by j’lists tagging). Add development costs and the bill easily reaches 30,000€. Who can afford that today?
Do you have data on how much Amazon spends on its recommendation engine, for instance?
Tagging is not the only way to make a recommendation engine work properly, you can use logs of IP and URL from servers to make a “People who read this article also read…” fonction.
However, there is one rule : The more datas the better the recommandation is. Corollary: It’s a bit difficult to make such algorythms work on a single blog.
PS: happy that you open your blog to comments!
Hi Fréd, great article and so true, take a look at the site Lepoint.fr, we implemented a tool (antidot) that gives readers (automatically) links to other articles - no tagging, it’s an algorythm. Of course we can always do more, but this is a good start, and economically viable !
Aimée
ps and as nicolas says, all of this does take some kind of investment - either human or technological - and the hardest part (as you know, I know) is figuring out how to pay for it !
Frédéric , we’re not exactly to your spec, but we’re pretty close with a passive filtering engine that’s been running for a year on our Dallas, TX-based site and that now has an integrated ad server:
http://www.pegasusnews.com/about/dailyyou/
http://www.pegasusnews.com/blogs/pegasusnewsblog/2009/feb/02/adserver/
It’s still far from perfect, but it’s a lot of fun to be able to give contest prizes to readers who haven’t even entered, simply based on what we know they like:
http://www.pegasusnews.com/blogs/pegasusnewsblog/2008/feb/05/uncontest/
Thank you for a very interesting article! I think you are right in pointing out that news sites should use recommendation engines. Far too long news organizations have assumed - wrongly, in my opinion - that we are very competent in structuring information. We are not. Instead news organizations typically only organize their material along two dimensions: Date of publication (recent is big) and which main section of the site/paper content belongs to. All other meta information - like geo coordinates, tags, etc - we have been poor at adding.
But how to do this now? Are there any automatic recommendation engines out there which would work perfect for news sites? Anyone has suggestions?
My original comment appears stuck in moderation (probably because of links), but we’ve got a live passive recommendation engine on a Dallas, TX news site that does much of what you describe here. We use it to deliver individually customized news and advertising.
The vector you describe is the missing piece– some good ideas here that we’ll look at incorporating.
Recommendation engines are important for news sites. Period.
You site different examples of implicit recommendation in your post, all of which make sense and will resonate with different groups of users. The Amazon approach is actually a catch-all for passive recommendation since it combines a number of means to push additional items to users based on:
your prior purchases
your prior searches
others who have purchased items similar to you
your explicit wishlist
… and so on
The issue for most news organizations comes down to bandwidth issues - not having enough resources internally to build and support a system that can read peripheral metadata about their content. It is a challenge they face. However, implementing such a system is a necessity in order to create/increase user stickiness as well as loyalty. Users don’t want to invest their time in discovering additional news, if it is not in front of them, they will likely leave your site and revert back to a search engine as a means of new discovery. (Something we like to think does not happen anymore, but does in large numbers.)
I have seen a number of good apps out there, some open-source, others require payment, but all will require a point person internally to program, QA, maintain, etc. In the end, it comes down to a business decision… prioritizing what you think will get you to where you ultimately want to be.
4 Trackbacks
[...] Otagowane Marketing internetowy, Nowe trendy, serwisy internetowe, treści No właśnie. Dlaczego nikt nie robi rekomendacji wiadomości i treści na podobnej zasadzie, jak Amazon? Monday Note rozbija temat na drobne. [...]
[...] value from the “long tail” of archives, either through recommendation engines (see Monday Note #73) or through well disseminated [...]
[...] than we should not develop better recommendation engines: as I explained in an earlier article (see Recommendation Engines: A Must for News Sites ), I think recommendation engines hold great potential to increase the number of pageviews for the [...]
[...] few months ago, I mentioned the importance of recommendation engines comparable to the ones used by e-commerce sites. These engines can be algorithm-based — [...]