Algorithm that predicts what topics will trend on Twitter

Indian origin researchers have come up with a new algorithm that predicts which Twitter topics will trend hours in advance and offers a new technique for analyzing data that fluctuate over time.

By:ANI
| Updated on: Nov 02 2012, 17:17 IST

image caption — Social-networking-site-Twitter-Shutterstock-Image

Indian origin researchers have come up with a new algorithm that predicts which Twitter topics will trend hours in advance and offers a new technique for analyzing data that fluctuate over time.

Twitter's home page features a regularly updated list of topics that are 'trending' - meaning that tweets about them have suddenly exploded in volume.

You may be interested in

MobilesTablets Laptops

7% OFF

28% OFF

A position on the list is highly coveted as a source of free publicity, but the selection of topics is automatic, based on a proprietary algorithm that factors in both the number of tweets and recent increases in that number.

Also read

Looking for a smartphone? To check mobile finder click here.

At the Interdisciplinary Workshop on Information and Decision in Social Networks at MIT in November, Associate Professor Devavrat Shah and his student, Stanislav Nikolov, will present a new algorithm that can, with 95 percent accuracy, predict which topics will trend an average of an hour and a half before Twitter's algorithm puts them on the list - and sometimes as much as four or five hours before.

The algorithm could be of great interest to Twitter, which could charge a premium for ads linked to popular topics, but it also represents a new approach to statistical analysis that could, in theory, apply to any quantity that varies over time: the duration of a bus ride, ticket sales for films, maybe even stock prices.

Like all machine-learning algorithms, Shah and Nikolov's needs to be 'trained': it combs through data in a sample set - in this case, data about topics that previously did and did not trend - and tries to find meaningful patterns.

What distinguishes it is that it's nonparametric, meaning that it makes no assumptions about the shape of patterns.

In the standard approach to machine learning, Shah explains, researchers would posit a 'model' - a general hypothesis about the shape of the pattern whose specifics need to be inferred.

'You'd say, 'Series of trending things ... remain small for some time and then there is a step,'' Shah said.

'This is a very simplistic model. Now, based on the data, you try to train for when the jump happens, and how much of a jump happens.

'The problem with this is, I don't know that things that trend have a step function.

'There are a thousand things that could happen,' he said.

So instead, he says, he and Nikolov 'just let the data decide.'

In particular, their algorithm compares changes over time in the number of tweets about each new topic to the changes over time of every sample in the training set.

Samples whose statistics resemble those of the new topic are given more weight in predicting whether the new topic will trend or not. In effect, Shah explains, each sample 'votes' on whether the new topic will trend, but some samples' votes count more than others'.

The weighted votes are then combined, giving a probabilistic estimate of the likelihood that the new topic will trend.

In Shah and Nikolov's experiments, the training set consisted of data on 200 Twitter topics that did trend and 200 that didn't. In real time, they set their algorithm loose on live tweets, predicting trending with 95 percent accuracy and a 4 percent false-positive rate.

Shah predicts, however, that the system's accuracy will improve as the size of the training set increases. 'The training sets are very small,' he says, 'but we still get strong results.'

Of course, the larger the training set, the greater the computational cost of executing Shah and Nikolov's algorithm. Indeed, Shah says, curbing computational complexity is the reason that machine-learning algorithms typically employ parametric models in the first place.

'Our computation scales proportionately with the data,' Shah said.

But on the Web, he adds, computational resources scale with the data, too: As Facebook or Google add customers, they also add servers. So his and Nikolov's algorithm is designed so that its execution can be split up among separate machines.

'It is perfectly suited to the modern computational framework,' he said.

In principle, Shah says, the new algorithm could be applied to any sequence of measurements performed at regular intervals. But the correlation between historical data and future events may not always be as clear cut as in the case of Twitter posts.

Filtering out all the noise in the historical data might require such enormous training sets that the problem becomes computationally intractable even for a massively distributed program. But if the right subset of training data can be identified, Shah says, 'It will work.'

'People go to social-media sites to find out what's happening now,' Ashish Goel from Stanford University and a member of Twitter's technical advisory board said.

'So in that sense, speeding up the process is something that is very useful,' Goel added.

Catch all the Latest Tech News, Mobile News, Laptop News, Gaming news, Wearables News , How To News, also keep up with us on Whatsapp channel,Twitter, Facebook, Google News, and Instagram. For our latest videos, subscribe to our YouTube channel.

First Published Date: 02 Nov, 17:13 IST

NEXT ARTICLE BEGINS

Best Deals For You

Air purifiers to buy in India for healthy and clean air- Here are top 5 picks

Trending Gadgets

Mobiles Laptops Tablets

Algorithm that predicts what topics will trend on Twitter

Indian origin researchers have come up with a new algorithm that predicts which Twitter topics will trend hours in advance and offers a new technique for analyzing data that fluctuate over time.

You may be interested in

Tips & Tricks

iPhone 16 series, OnePlus 13, and other 5 flagship smartphones to launch in 2024

Apple Music can now play ‘same’ playlist on YouTube Music: Here’s how it is possible

iPhone users will be able to transcribe voice recordings with iOS 18: Here is how it works

Protect your Aadhaar Card: How to check, lock, and report misuse effectively online

Wondering if your iPhone has hidden apps? Know how to find and manage them easily

Editor’s Pick

Trending Stories

iPhone SE 4 launch still months away, powerful mid-ranger likely to arrive in…

Apple’s ‘Glowtime’ Event on 9 September: These products, including iPhone SE 4, are not expected to launch

iPhone 16 Pro must improve in these 3 areas—And I say this after using iPhone 15 Pro for almost a year

Aadhaar Card Update for free online: Act before September 14 to avoid future fees

Anil Kapoor featured in TIME's 100 Most Influential People in AI cover, but Sam Altman misses out: Here’s why

Gaming

GTA 6 launch: 4 underrated features that Rockstar shouldn’t overlook

GTA 6 leaked release date stirs speculation among fans: Here’s when it's coming

GTA 6 leaked weather effects leave fans stunned as release date speculation grows amid delay concerns

GTA 6 leak uncovers early Vice City build, showing debug menus, asset tweaks, and variants

GTA 6 release date might have leaked, possibly aligning with a key franchise milestone

Best Deals For You

Air purifiers to buy in India for healthy and clean air- Here are top 5 picks

5 best smartphones for your eyes: Xiaomi 13, Honor 90 to Motorola Edge Plus, check list

Top 10 smartwatch brands: Leading the market with innovation

Japanese toilets in India: TOTO washlet starting price, features and all details to know

Amazon Diwali Sale 2024: Get up to 40% off on ASUS Vivobook S 16 OLED to Lenovo Yoga Slim 6 and more laptops

Trending News

iPhone SE 4 launch still months away, powerful mid-ranger likely to arrive in…

Apple’s ‘Glowtime’ Event on 9 September: These products, including iPhone SE 4, are not expected to launch

iPhone 16 Pro must improve in these 3 areas—And I say this after using iPhone 15 Pro for almost a year

Aadhaar Card Update for free online: Act before September 14 to avoid future fees

Anil Kapoor featured in TIME's 100 Most Influential People in AI cover, but Sam Altman misses out: Here’s why

Trending Gadgets