UNIT 2: SOCIAL NETWORK ANALYSIS • LESSON 4

🤖

Bot Detection & Network Manipulation

Learn to identify bots, detect sockpuppets, spot astroturfing campaigns, and distinguish legitimate automation from malicious manipulation.

⏸️ Reflection Pause

Trust in the Age of Deception

Not everything is as it appears online. Bots masquerade as humans. Sockpuppets pretend to be grassroots movements. Manipulation networks undermine trust. When you can't tell who's real, what do you believe?

Questions to Consider:

How do you decide what to believe online? What's your process for verification?
If 30% of accounts are bots, does that change how you view "viral" content?
What's the difference between healthy skepticism and paranoid cynicism?

Bots exploit human cognitive biases:

Bandwagon effect: "Everyone agrees, so it must be true"
Social proof: "Thousands of retweets = must be valid"
Trust heuristics: "If it looks human, it probably is"

Manipulation networks undermine democratic discourse. When you can't trust that the "person" you're talking to is real, civil conversation collapses.

Digital discernment is the literacy of our age. Test everything. Trust wisely.

🎮 Game 1: Bot or Human?

SCENARIO

You're analyzing Twitter accounts discussing a political topic. One account raises red flags.

🤖

PatriotEagle1776

@patrioteagle1776 • Joined March 2024

347

Following

12

Followers

4,892

Tweets

Recent Activity:
• Posted 127 tweets in last 24 hours (avg 1 per 11 minutes)
• 89% of tweets are retweets from @PoliticalSource1
• Posts 24/7 with no sleep breaks
• Generic profile pic (stock photo)
• Bio: "Proud American 🇺🇸 Truth seeker"

Is this account likely a BOT or HUMAN?

🎮 Game 2: Detect Sockpuppets

SCENARIO

A controversial Reddit post has 5 supportive comments from different accounts. You notice patterns.

                    Account Analysis:

                    @TechGuru2024: "This is brilliant! Finally someone speaking truth!"

                    Posted: 10:47 AM • Account created: March 15, 2024

                    @ScienceFan88: "This is brilliant! Finally someone speaking truth!"

                    Posted: 10:48 AM • Account created: March 15, 2024

                    @RandomUser_42: "Couldn't agree more with this analysis."

                    Posted: 10:49 AM • Account created: March 16, 2024

                    @TruthSeeker99: "This is brilliant! Finally someone speaking truth!"

                    Posted: 10:50 AM • Account created: March 15, 2024

                    @OpenMindNow: "Everyone needs to read this important thread."

                    Posted: 10:51 AM • Account created: March 16, 2024

What pattern suggests sockpuppetry (multiple fake accounts by one person)?

🎮 Game 3: Spot Astroturfing

SCENARIO

A hashtag #SupportBigPharma starts trending. You investigate whether it's grassroots (real people) or astroturf (fake grassroots).

                    Campaign Analysis:

                    • 47,000 tweets in 6 hours

                    • 83% of accounts created in last 30 days

                    • Identical tweet templates: "I trust #BigPharma to keep us safe! [emoji][emoji]"

                    • Coordinated posting spike: All accounts tweeted within 15-minute window

                    • Follower networks: 94% overlap (same accounts follow same accounts)

                    • Funding source: Not disclosed, but IP analysis shows single data center origin

Is this a REAL grassroots movement or ASTROTURFING?

🎮 Game 4: Find Coordination Networks

SCENARIO

You're analyzing 200 Facebook accounts posting about a political candidate. Which network structure indicates coordinated inauthentic behavior?

Which pattern is STRONGEST evidence of coordination?

🎮 Game 5: Legitimate vs Malicious Bots

SCENARIO

You're a platform moderator deciding whether to ban bots. Not all automation is evil—some bots serve useful purposes.

Which bot should you BAN for violating platform policies?

📚 Bot Detection Techniques

Bot detection is an arms race—as detectors improve, bots evolve. Here are key detection signals:

                    1. Behavioral Patterns

                    • Posting frequency: Humans sleep; bots don't (24/7 activity)

                    • Response time: Instant replies suggest automation

                    • Content repetition: Copy-paste identical messages

                    • Engagement ratio: High posts, low authentic engagement

                    2. Network Structure

                    • Follower/following ratio: Bots often follow many, have few followers

                    • Coordination clusters: Groups posting identical content at same time

                    • Star topology: Many bots following one central account

                    3. Account Metadata

                    • Creation date: New accounts in bulk (March 15, 2024 × 1,000)

                    • Profile completeness: Stock photos, generic bios

                    • Username patterns: @Name12345678 format

Advanced Techniques:

Natural language processing: Detect machine-generated text patterns
CAPTCHA challenges: Force account to prove it's human (but bots evolve)
Behavioral biometrics: Mouse movements, typing speed, click patterns
Graph analysis: Identify bot clusters via community detection

📚 The Bot Detection Arms Race

Why is bot detection so hard? Because bots evolve to evade detection.

Evolution of Bot Sophistication

2010s: Simple bots (easy to detect via posting frequency)
2015: Bots add random delays (mimic human sleep patterns)
2018: GPT-2 enables realistic text generation
2020: Bots use stolen profile photos (harder to spot fakes)
2023: LLM-powered bots (ChatGPT-level conversation)
2024+: Multimodal bots (generate images, videos, voices)

Legitimate Automation vs Manipulation:

                    ✅ LEGITIMATE BOTS (should be allowed):

                    • News aggregators: @NYTimesBot auto-posts new articles

                    • Customer service: @SupportBot answers FAQs 24/7

                    • Accessibility tools: Image description bots for blind users

                    • Fact-checkers: @FactCheckBot flags false claims

                    • Disclosure: All clearly labeled as bots

❌ MALICIOUS BOTS (should be banned):

• Propaganda bots: Amplify political messages deceptively
• Spam bots: Flood platforms with ads, scams
• Sockpuppet networks: Fake grassroots support
• Harassment bots: Coordinated attacks on individuals
• Deception: Pretend to be human

Platform Policy Dilemma: Ban all bots (lose useful automation) or allow bots (enable manipulation)? Most platforms choose a middle path: Require disclosure + ban deceptive behavior.

📚 Beyond Bots: Computational Propaganda

You've learned about bot detection—identifying fake accounts. But modern information warfare goes far beyond simple bots. Welcome to computational propaganda.

                    Computational Propaganda = Using algorithms, automation, and big data to manipulate public opinion

                    This includes:

                    • State actor networks: Government-backed coordination campaigns

                    • Dual-sided warfare: Both sides of a conflict running coordinated campaigns

                    • Cross-platform coordination: Synchronized amplification across Twitter, Facebook, TikTok

                    • Sophisticated tactics: Language manipulation, temporal patterns, engagement asymmetry

Real-World Example: Ukraine Dam Crisis (AgentAcademy Study)

Case: Kakhovka Dam Breach (June 2023)

Three AI models analyzed 266,000 tweets about the Ukrainian dam disaster. All three independently found the same unexpected pattern:

🇨🇺 Cuban state media (@PartidoPCC, @DiazCanelB) were among the biggest amplifiers.

Why is Cuba interested in a Ukrainian dam? This is coordinated state propaganda:
• 72% of posts were retweets (amplification, not discourse)
• 1,390 accounts created the week Russia invaded—then dormant until this crisis
• Off-topic injection: Biden/Burisma corruption inserted into dam disaster conversations

Key Insight: Simple bot detection (posting frequency, profile photos) won't catch this. These are real accounts following coordinated instructions.

Computational propaganda requires network analysis, temporal pattern detection, and cross-account coordination tracking.

🎮 Game 6: State Actor Detection

REAL CASE STUDY

AgentAcademy analyzed Chinese digital diplomacy on TikTok. You're reviewing the findings.

                    TikTok Analysis: 1,994 videos + 48,070 comments

                    Finding 1: China-general content gets 60x more plays than Xinjiang content (5.3B vs 87M plays)

                    Finding 2: Only 3.5% of comments are in Chinese—80.9% Latin/English

                    Finding 3: State media accounts get 28-75% higher engagement than organic creators

                    Finding 4: 10% duplicate comments, top emoji (🥰🥰🥰) repeated 300x

What does this pattern reveal about the campaign's purpose?

📚 Dual-Sided Information Warfare

Myth: Only one side uses bots and coordination.
Reality: Both sides often run competing propaganda campaigns.

Case Study: Xinjiang Cotton Controversy (AgentAcademy Study)

When H&M faced boycotts over Xinjiang labor concerns (March 2021), three AI models analyzed 92,038 tweets. They found both sides coordinating:

🇨🇳 Pro-China Side:
• Central amplifier: @SpokespersonCHN (China's Foreign Ministry)
• State media push: CGTN, Global Times, Xinhua
• March 25-26 spike: 36% of ALL tweets in 2 days

🇺🇸 Pro-Uyghur Side:
• Central amplifiers: @MarcRubio, @nathanlawkc
• Human rights organizations: @hrw, @amnesty
• 2x higher engagement despite lower volume (308 vs 144 avg retweets)

The Battle:
• 88% of tweets were retweets (not original discourse)
• This wasn't a conversation—it was a volume war
• 10,027 suspicious accounts flagged

Lesson: Detection frameworks that assume single-actor coordination miss half the picture. Always check for adversarial amplification.

                    AgentAcademy Improvement Added:

                    "Dual-sided coordination framework—check for adversarial amplification on both sides of contentious issues"

📚 Advanced Detection Techniques from AgentAcademy

After analyzing 7 computational propaganda campaigns, here are the detection patterns that work:

                    1. Language Anomaly Detection

                    Case: Belarus Protests (#StandWithBelarus, Sept 2020)

                    Problem: 38% of tweets were in Thai. Bot attack?

                    Investigation: All three AIs found—this was the Milk Tea Alliance (Thai democracy activists showing solidarity)

                    Evidence: Zero Thai accounts exceeded 50 tweets/day, 22,405 unique accounts all retweeted @netiwitc

                    Detection Rule: Language >20% non-local → investigate (could be bots OR cross-movement solidarity)

                    2. Temporal Spike Invalidates Averages

                    When single day contains >30% of total volume, "average engagement" is meaningless.

                    Examples from AgentAcademy:

                    • Kashmir: 33,000 tweets on one day, then 85% drop (flash mob, not organic)

                    • Xinjiang: 36% of tweets in 2 days during H&M boycott announcement

                    • Belarus Thai wave: 89% of Thai tweets on single day (Sept 20)

                    Detection Rule: Peak-to-trough ratio >4:1 → identify triggering event before interpreting

                    3. High Retweet Ratios = Amplification Battle

                    When >80% of tweets are retweets, you're studying amplification, not discourse.

                    What to do:

                    • Xinjiang case: 88% retweets → switched to network analysis (who amplifies whom?)

                    • Kakhovka Dam: 72% retweets → tracked retweet cascades from state accounts

                    Detection Rule: RT ratio >80% → use network analysis instead of engagement analysis

                    4. State Media Engagement Premium

                    Case: Chinese TikTok Accounts

                    State-affiliated accounts got 28-75% higher engagement than organic creators with similar follower counts.

                    Explanation: Algorithmic boost OR coordinated support OR genuine interest?

                    Detection Rule: Compare engagement rates by account type after controlling for follower count (use log-transform!)

Pro Tip: Multi-model validation matters. When Claude, GLM-5, and Kimi all independently find the same pattern—it's robust. That's why VineAcademy's Collaborative Playground uses this approach.

🧠 CommDAAF Critical Thinking Checkpoint

Before you continue, reflect deeply on what you've learned. Write thoughtful responses (minimum 20 characters each).

1. What makes bot detection an "arms race" between detectors and bot creators? How do bots evolve to evade detection?

0 characters

2. How can legitimate automation (news aggregators, support bots, accessibility tools) be distinguished from manipulation bots? What criteria matter?

0 characters

3. Should platforms ban ALL bots, or are some beneficial? Defend your position with specific examples and an ethical framework.

0 characters

🏆

Unit 2 Complete!

You've mastered Social Network Analysis & Computational Propaganda! You can now calculate centrality, detect communities, predict cascades, identify bots, and recognize state actor information warfare. You understand dual-sided campaigns, language anomalies, temporal patterns, and the techniques used in real-world computational propaganda. You're equipped to navigate—and analyze—the networked world with a critical eye.

⚡ 0 XP Earned