Unit 0: Foundations

✨

How Vibe Coding Actually Works

Understanding autonomous AI agents: Moving beyond autocomplete to systems with reasoning, tool-calling capacities, and the power to reshape research

✨ What IS "Vibe Coding"?

Vibe coding was coined by Andrej Karpathy (OpenAI co-founder, former Tesla AI leader) in February 2025. Unlike traditional coding requiring syntax mastery, vibe coding relies on conversational prompts to generate code, set up project structures, debug errors, and even orchestrate multi-step research workflows.

As of February 2026, 92% of US developers use AI coding tools daily, and 41% of all code written globally is now AI-generated. This isn't futuristic speculation—it's happening right now.

📊 Real-World Impact

25% of Y Combinator 2025 startups built the majority of their codebase with AI assistance. Development cycles accelerate by up to 55% compared to manual coding.

Bank of America uses conversational coding agents to rapidly prototype fraud detection algorithms, cutting delivery times by 70%. Shopify utilizes AI to automate store template creation, reducing routine coding workloads by over a third.

🤖 What ARE Vibe Coding Models?

Modern AI coding models are far more than "autocomplete on steroids." They're autonomous agentic systems with reasoning capabilities, tool-calling functions, and contextual judgment.

Reasoning & Planning: Can break down complex tasks into subtasks, evaluate trade-offs, and adapt strategies based on feedback
Tool Integration: Execute code, query databases, call APIs, search documentation, and interface with development environments
Contextual Judgment: Make analytical decisions, generate interpretations, and suggest methodological approaches—not just pattern matching
Multi-Step Workflows: Orchestrate entire research pipelines: data collection → cleaning → analysis → validation → visualization

Think of them less as "smart autocomplete" and more as junior research assistants with photographic memory of millions of codebases—capable of independent work, but requiring oversight.

🤝 But, What ARE AI Agents?

AI agents are systems that can operate autonomously to achieve goals, making decisions and taking actions without constant human direction. Unlike traditional software that follows rigid instructions, agents exercise contextual judgment.

🔍 Key Difference: Tools vs. Agents

Tool (like GitHub Copilot): Suggests next line of code when you type. You're in control.

Agent (like Cursor AI, Replit Agent): You say "build a task manager with drag-and-drop," and it generates a working prototype, sets up the development environment, finds libraries, and handles deployment. The agent orchestrates the entire workflow.

90% of software professionals now use AI agents (not just "copilots") daily. In 2021, building a Minimum Viable Product took three months and $50k. In 2026, an orchestrator can build, test, and deploy a functional SaaS over a weekend for the cost of an API subscription.

This is the power—and responsibility—of agentic social science.

📊 Top Vibe Coding Models (Feb 2026)

Model	SWE-bench Score	Best For
Claude Opus 4.6	80.8%	Code review, debugging, cybersecurity detection
Claude Sonnet 4.6	79.6%	98% of Opus quality at 60% cost, best value
Gemini 2.5 Pro	63.8%	WebDev leader, 1M context window, multimodal
DeepSeek V3.1	66%	Algorithmic tasks, 10-100x cheaper per token
Kimi K2.5	—	Native video processing, vision-text joint training
GLM-4.6/4.7	—	Open-source value leader, MIT license, $0.35/1M tokens

Data as of February 2026. SWE-bench Verified measures ability to solve real-world GitHub issues.

⚡ Vibe Coding Superpowers

Studies show project completion times can improve by up to 55% with AI assistance. Here's what vibe coding excels at in 2026:

Rapid Prototyping: Generate working MVPs in hours instead of weeks
Multi-Step Workflows: Orchestrate entire pipelines (data → analysis → viz → validation) autonomously
Context Translation: Convert analysis logic across languages/frameworks (R → Python, SQL → pandas)
Documentation Mastery: Instantly recall syntax, libraries, and API documentation from memory
Parallel Exploration: Test multiple analytical approaches simultaneously
Error Diagnosis: Identify bugs, suggest fixes, and explain why errors occur

📈 Verified Productivity Gains

According to Index.dev's 2026 survey, developers report cutting routine coding tasks by 30-40% and accelerating exploration phases by 50%+ when using AI agents.

✨ Real-World Vibe Coding Success Stories

Andrew Hall's Political Science Replication (2025): An AI agent replicated and extended a published political science paper in under an hour for approximately $10—work that took a trained researcher several days to verify. Hall envisions "100x research institutions" where small teams of expert researchers direct swarms of AI agents handling data collection, analysis, robustness checks, and literature synthesis in parallel.

Startup Speed Revolution: As recently as late 2025, AI coding assistants were "useful but halting and clumsy" (New York Times). By early 2026, tools like Cursor, Replit Agent, Lovable, and Bolt.new transformed the landscape. Users describe their app idea—"build a task manager with drag-and-drop"—and the system generates a working prototype running in a browser.

Enterprise Adoption: Shopify's AI automation reduced routine development tasks by over a third. Bank of America's fraud detection prototyping now takes 70% less time than manual approaches.

Sources: The New Stack, DEV Community

⚠️ Critical Limitations You MUST Know

AI agents are powerful, but they fail in specific, predictable ways. Knowing when NOT to trust AI is as important as knowing when to use it.

🧠 Developing "AI Intuition"

AI Intuition is the instinct of sensing something is off when viewing performance and outputs from AI agents—even when the code runs without errors.

Like an experienced chef who can tell by smell when something isn't quite right in the kitchen, skilled vibe coders develop a sixth sense for:

Results that are "too clean" or suspiciously perfect
Patterns that don't match your domain knowledge
Explanations that sound confident but lack depth
Statistical outputs that seem plausible but miss contextual nuances

This intuition comes from practice, validation, and learning to ask: "Does this FEEL right?"

What Vibe Coding Still Struggles With (2026)

By 2026, AI agents are excellent at syntax, debugging, and standard patterns. But they still fail at:

Causal Inference Design: Choosing appropriate identification strategies (RDD, DID, IV) requires theoretical understanding AI doesn't have
Measurement Validity: Can't assess whether a proxy variable actually captures your theoretical construct
Domain-Specific Context: Doesn't know if "engagement" means different things pre- vs. post-algorithm change on a platform
Outlier Interpretation: Treats outliers as errors to remove, missing potential theoretical insights
Statistical Assumption Violations: Runs analyses without checking if your data meets model requirements (normality, independence, homoscedasticity)
Theoretical Coherence: Generates technically correct code that answers the wrong research question

🚨 Real Vibe Coding Challenges (2026)

Challenge 1: Causal vs. Predictive Modeling

Student Request: "Does social media usage cause depression?"

AI Generated: Regression model with high R² showing correlation

Problem: AI doesn't distinguish causal inference from prediction. Strong correlation doesn't establish causation—could be reverse causation (depression → more social media) or confounding (loneliness causes both).

Challenge 2: Theoretical Construct Validity

Student Request: "Measure political polarization on Twitter"

AI Generated: Sentiment distance between left/right-leaning accounts

Problem: AI chose ONE operationalization when "polarization" could mean: (1) sentiment extremity, (2) network segregation, (3) discourse toxicity, or (4) belief distribution spread. Each measures something different.

Lesson: In 2026, AI agents execute analyses brilliantly—but they can't tell you if you're analyzing the RIGHT thing. Research design wisdom still requires humans.

⚖️ Power, Ethics, and Accountability in Agentic Social Science

AI agents don't just inherit biases from training data—they reshape how we think, decide, and delegate intellectual labor. The risks aren't just technical errors; they're about power, accountability, and who benefits from automated knowledge production.

This section draws on emerging scholarship about agentic social science: the deployment of AI agents for automated computational research, operating with guardrails, accountability structures, and human oversight.

🎯 Agentic Attention: The "Silent Default" Problem

Key Insight: AI agents make consequential methodological decisions through default settings—similarity thresholds, time windows, classification schemes—often without human awareness.

When you ask an AI agent to "analyze sentiment over time," it chooses:

Which sentiment model to use (VADER? BERT? Custom?)
How to aggregate time periods (daily? weekly? monthly?)
Whether to remove outliers (and what counts as "outlier")
Which visualization type to generate

Problem: We delegate decision-making to AI defaults that may not be optimized for OUR research context. Some defaults are convenient; others are subtly misaligned with our goals.

Question to Ask: "What decisions did the AI make FOR me, and should I have made them differently?"

📊 The Regularization Trap: When AI Excludes What Matters

Key Insight: AI models are trained to identify the most regular patterns—but outliers, edge cases, and rare events are often where the most important insights live.

Example: An AI agent analyzing social movements might classify protest tactics by frequency: peaceful marches (90%), boycotts (7%), disruptive actions (3%). The agent focuses analysis on marches because they're "most representative."

Problem: Disruptive actions might be rare but theoretically crucial—they're the tactics that drive policy change! By optimizing for "regular patterns," AI can systematically exclude what's most sociologically interesting.

Question to Ask: "What's being left out of this analysis? Are the outliers noise—or signal?"

🏛️ AI Empires: Data Centers, Surveillance, and Power

Drawing on Karen Hao's "AI Empire": AI models and companies operate like empires, contesting land (data centers), water resources (cooling systems), manpower (human annotators), and intellectual capital.

Hidden Costs:

Environmental Impact: Training large models requires massive energy consumption, often disproportionately impacting vulnerable communities near data centers
Labor Exploitation: "AI-generated" outputs rely on underpaid human annotators in the Global South labeling training data
Data Extraction: Your research data (even "anonymized") becomes training material for corporate AI systems without consent
Surveillance Infrastructure: AI tools normalize data collection practices that enable surveillance capitalism

Question to Ask: "Who profits from this AI system? Whose labor and resources made it possible?"

⚖️ AI Accountability: Who's Responsible When AI Fails?

Critical Question: When an AI agent automates analysis that leads to flawed conclusions, who is accountable?

The researcher who delegated the task?
The AI company that built the model?
The platform that provided the data?
The reviewers who didn't catch the error?

Human-in-the-Loop is Critical: AI agents should augment, not replace, human judgment. This means:

Researchers must understand what AI agents are doing, not just accept outputs
Validation checkpoints should be mandatory, not optional
Interpretive decisions (not just technical execution) remain human responsibility

The Hard Truth: If you can't explain how your AI agent reached a conclusion, you're not doing research—you're outsourcing intellectual responsibility.

🌐 Understanding the AI Ecosystem

To use AI responsibly, you need to understand the infrastructure behind it:

APIs (Application Programming Interfaces): How you access AI models. Choices: OpenAI API, Anthropic API, open-source self-hosted models
Open vs. Closed Models: Open models (GLM, DeepSeek) allow inspection and customization. Closed models (GPT, Claude) are black boxes—faster and more powerful, but less transparent
Model Temperature: Controls randomness. Low temp (0.2) = consistent, deterministic. High temp (0.9) = creative, variable. Research requires LOW temperature for reproducibility.
Training Data & Privacy: Most AI models are trained on public internet data. Your prompts may be used for training unless you use privacy-focused APIs.

🔒 VineAcademy's Privacy Commitment

We run on Ollama Cloud with privacy-first models. Your data and prompts are never used to train models. We use a multi-model framework (GLM-5, Kimi K2.5, Minimax M2.5) to ensure diverse perspectives and reduce single-model bias.

🛡️ CommDAAF: Your Multi-Model Guardrail Framework

CommDAAF (Communication Data Analysis and Automation Framework) is VineAcademy's answer to the accountability problem. It's a quality control system for agentic computational research.

Just as laboratory protocols ensure reproducibility in experimental science, CommDAAF enforces:

Mandatory probing questions before analysis begins
Tiered validation requirements (exploratory → pilot → publication-ready)
Assumption auditing requiring explicit acknowledgment of interpretive choices
Credibility ratings attached to all claims
Structured disagreement protocols when multiple agents produce conflicting interpretations

🤝 Multi-Model Framework

VineAcademy uses multiple AI models (not just one) to generate analyses. Each model brings different "temperaments"—one might be conservative, another creative. When models disagree, we force debate and synthesis. This epistemic diversity reduces systematic errors.

The Three CommDAAF Questions

Question 1: What EXACTLY are you measuring?

Why: "Engagement" could mean likes, comments, shares, retweets, or any combination. AI will pick one unless you specify.

Example: "I'm measuring total engagement as (likes + comments + shares) per post, averaged over the last 30 days."

Question 2: How will you VALIDATE your results?

Why: AI can confidently give you wrong answers. You MUST verify.

Example: "I'll manually code 20 random posts and compare my engagement counts to the AI's calculations."

Question 3: What ELSE could explain your findings?

Why: Correlation ≠ causation. Always consider alternative explanations.

Example: "High engagement could be due to: (1) controversial content, (2) bot amplification, (3) platform algorithm changes, or (4) external events driving attention."

⏸️ Reflection Pause: With Great Power...

Vibe coding gives you superpowers—you can analyze millions of posts, orchestrate complex research pipelines, generate insights in hours that once took weeks.

But power without wisdom, accountability, and ethical grounding is dangerous.

Questions to Consider:

Just because you can automate analysis, does that mean you should?
If your AI analysis is wrong and people make decisions based on it, who's responsible?
Are you using AI to augment your expertise—or to bypass learning?
When AI optimizes for "regular patterns," whose stories get excluded?
Who profits from the AI systems you're using? What are the hidden costs?

The question isn't whether AI is powerful. The question is: What kind of researcher will you be with that power?

🎯 Knowledge Check 1

A researcher uses an AI agent to analyze Twitter sentiment about climate change. The agent reports "65% positive sentiment." What should the researcher do FIRST before trusting this result?

A) Check if the AI defined "positive" the same way I would in this context B) Manually code 20 random tweets to verify AI accuracy C) Ask what decisions the AI made by default (model choice, time aggregation, outlier removal) D) All of the above—CommDAAF requires operationalization clarity, manual validation, and understanding AI defaults

🎯 Knowledge Check 2

Which of the following best describes "AI Intuition"?

A) The AI's ability to make intuitive leaps in reasoning B) Your instinct for sensing when AI outputs feel "off" even when code runs without errors C) AI's capacity to understand human emotions D) The speed at which AI generates code

🎉 Lesson Complete!

✨

75 XP

🏆 Achievement Unlocked: "Vibe Coding Literacy"

You now understand:
✓ How vibe coding and AI agents work
✓ Agentic systems vs. autocomplete tools
✓ Top models as of February 2026
✓ AI Intuition and limitations
✓ Power, ethics, and accountability
✓ CommDAAF multi-model framework

Ready to use AI responsibly in social science research!