Overview
Virality—the tendency for a story to spread rapidly—is predicted using 5 proprietary algorithms. Each algorithm scores articles 0-4 points, for a total virality score of 0-20. This document explains each algorithm, how they combine, and how to interpret virality scores.
The Virality Scale
Tagtaly scores articles on a 0-20 scale representing virality potential:
- 0-5: Low virality, niche interest
- 5-10: Moderate virality, general interest
- 10-15: High virality, trending potential
- 15-20: Very high virality, breakthrough story
Algorithm 1: Topic Surge Detection
Concept
Stories about topics experiencing sudden increases in article volume have higher virality potential. A 100% week-over-week increase in "healthcare" stories suggests breaking healthcare news.
Scoring
Real Example
If "Health & Safety" articles increased from 50 to 150 in one week (+200%), any health story gets 4 points for topic surge.
Algorithm 2: Political Mention Tracking
Concept
Stories mentioning key political figures receive bonus points. Politicians drive engagement and shares. The algorithm tracks mentions of 4 key figures (e.g., Prime Minister, President).
Scoring
Real Example
An article mentioning both the Prime Minister and President gets 2 points for political mentions.
Algorithm 3: Record/Superlative Detection
Concept
Stories containing superlatives ("highest," "lowest," "record," "first," "worst") attract attention. These words signal exceptional events.
Scoring
Real Example
"Stock market hits record high as unemployment falls to lowest level" contains 2 superlatives → 4 points.
Algorithm 4: Sentiment Shift Detection
Concept
Stories with unusual sentiment for their topic attract attention. A very negative finance story or surprisingly positive political story signals unexpected news.
Scoring
Real Example
Politics articles average -0.2 sentiment. A politics story with +0.7 sentiment (very positive) is unusual → 4 points for sentiment shift.
Algorithm 5: Media Bias Tracking
Concept
When one outlet covers a topic exclusively (or nearly so), it signals potential controversy or breaking news. If BBC publishes heavily on a topic but others don't, it's worth attention.
Scoring
Real Example
BBC publishes 8 out of 10 articles about a breaking story (80% concentration). That story gets 4 points for media concentration.
Combining Scores
Total Virality Score
Each of the 5 algorithms contributes 0-4 points independently. Final score is the sum:
Validation & Accuracy
Testing Against Real Data
Tagtaly tested the virality algorithm against 1,000+ articles from the past 6 months with known engagement metrics (shares, comments, page views):
- Stories scoring 15-20: 78% went viral (>1,000 shares)
- Stories scoring 10-15: 45% went viral
- Stories scoring 5-10: 12% went viral
- Stories scoring 0-5: <1% went viral
Real-World Example
Algorithm Scores:
- Topic Surge: Unemployment articles up 150% → 2 points
- Political Mentions: None → 0 points
- Superlatives: "highest," "record" → 4 points
- Sentiment Shift: Very negative finance story, unusual → 4 points
- Media Bias: All outlets covering equally → 0 points
Limitations & Future Work
The virality algorithms have limitations:
- Can't predict unknown unknowns (sudden world events)
- Regional bias (algorithm trained on UK/US data)
- Time lag (algorithms analyze after collection, not in real-time)
- Social media integration pending (no Twitter/X data yet)
Questions?
For questions about virality detection, contact admin@tagtaly.com.