PentaLab Simulator - Interactive Governance Testing

📊 Results

📖 Story

🔎 How It Works

⚙

Configure & Run

Choose a preset or configure your own scenario, then press Run Simulation. The same agents run under five governance models and results are compared.

⚡ Simulation in Progress

Connecting to LLM...

● Anarchy

Waiting...

● B. Dictatorship

Waiting...

● Pentanomics

Waiting...

● Open Borders

Waiting...

● Populism

Waiting...

● Technocracy

Waiting...

● Collectivism

Waiting...

Preparing... 0%

💬 Live Activity

Net Good Per Round

Agent Details: Anarchy

▼

Agent Details: Benevolent Dictatorship

▼

Agent Details: Pentanomics

▼

Agent Details: Open Borders

▼

Agent Details: Populism

▼

Agent Details: Technocracy

▼

Agent Details: Collectivism

▼

📖

Run a simulation to see its story

After running a simulation, this tab will show a narrative of what happened: what each agent did, how they competed, and how governance shaped the outcome.

🔍 What Is This Simulator?

▼

PentaLab is a controlled experiment engine that tests whether Pentanomics' three universal laws hold up in computational simulations. It runs the same scenario through seven different governance models simultaneously and compares the outcomes.

The simulator creates a small virtual society of software agents. Each agent makes decisions every round: work, cooperate, negotiate, steal, or abstain. The system measures the total value produced, harm caused, productivity, sustainability, innovation, and equality.

Two modes are available:

LLM Agents (default): Agents are powered by a large language model (Grok 4-1). Each agent has a rich persona that shapes its decisions. The LLM reasons about each situation in character, producing more nuanced and unpredictable behavior. Scenarios explore governance, politics, economics, and power dynamics.
Fast (Rules): Agents follow fixed probabilistic strategies (cooperative, malicious, etc.). Deterministic: same seed always produces the same results. Faster execution, useful for controlled experiments.

Important: This is a simplified model, not a prediction of real-world outcomes. It tests whether the logical structure of Pentanomics holds under controlled conditions. Real societies are vastly more complex.

⚙ The Seven Governance Models

▼

Anarchy (No Rules)

Every action is automatically approved. No agent is ever blocked.
Malicious agents can steal freely. There are no consequences except reputation loss.
Real-life parallel: A market with zero regulation. No police, no courts, no contracts.

Benevolent Dictatorship (Central Control)

One central authority evaluates every action before it happens.
The dictator blocks high-risk tasks (even productive ones), blocks agents with low reputation, and blocks all theft attempts.
The dictator has a 5% error rate: it sometimes blocks perfectly good actions by mistake.
Real-life parallel: A CEO who must personally approve every decision. Safe, but slow. Good actions get caught in bureaucracy.

Pentanomics (Distributed Authority via IC-AGI)

Uses real IC-AGI infrastructure: ControlPlane (capability tokens with scope and TTL), ThresholdAuthorizer (K-of-N approval for critical actions), CircuitBreaker (auto-isolate failing agents), AuditLog (append-only ledger of all actions).
Theft is structurally impossible: no capability token exists for the "steal" action. An agent that wants to steal is forced to negotiate instead.
Non-harmful actions are automatically approved. Only critical actions require threshold consensus.
Real-life parallel: A constitutional democracy with checks and balances. Laws prevent harm; everything else is free.

Open Borders (Foreign Anarchy)

Domestic governance works well: blocks theft, blocks agents with low reputation.
But foreign tasks get zero governance. Any agent can interact with foreign tasks unchecked.
External actors exploit freely while internal systems appear healthy on the surface.
Real-life parallel: A country with strong domestic laws but no border security, trade regulation, or foreign policy.

Populism (Political Anarchy)

Popularity (reputation) = political influence. Popular agents (≥70 rep) bypass all checks, including theft.
Unpopular agents (<35 rep) are silenced even when proposing good actions. The crowd, not reason, decides.
A 15% random "crowd block" chance disrupts even good agents. Mob emotion overrides rational governance.
Real-life parallel: Pure populism where charismatic leaders bypass checks and unpopular minorities are suppressed.

Technocracy (Governing Dictatorship)

Always blocks theft. But also blocks legitimate actions through excessive bureaucracy.
Block rates by task difficulty: Trivial 25%, Cooperative 35%, Critical 50%, Foreign 40%, Governing 60%.
Safe but paralyzed. Innovation dies under committee approvals. Agents spend more time waiting than working.
Real-life parallel: Over-regulated bureaucracy where permits take years and every action needs committee approval.

Collectivism (Public Dictatorship)

All rewards are pooled and redistributed equally. Individual output is dampened to 55% of normal.
Theft is blocked ideologically. But productive agents have no incentive to work harder than lazy ones.
The free-rider problem emerges: rational agents reduce effort since outcomes are shared regardless.
Real-life parallel: Forced collectivization where individual incentive is destroyed in the name of equality.

🤖 How Agents Decide

▼

LLM Agents receive a persona (e.g. "ruthless venture capitalist" or "idealistic community organizer") and use Grok 4-1 to reason about each situation. The LLM sees the task difficulty, reward, harm potential, and its own reputation, then decides in character. This produces emergent, unpredictable behavior that cannot be reduced to simple probabilities.

Rule-based Agents (Fast mode) follow fixed probabilistic strategies:

Cooperative - Always tries to cooperate or work honestly. Never steals. Produces steady value.
Self-Interested - Follows rules but negotiates hard. Maximizes own gain without breaking the law.
Opportunistic - Bends rules when possible. Has a 30% chance of stealing on cooperative tasks and 50% on critical tasks.
Malicious - Actively tries to cause harm. 60% chance of stealing on cooperative tasks. Attempts to corrupt governing actions.
Altruistic - Prioritizes group benefit over self. Cooperates even when it costs more. Builds high social trust.

The Verdict: After each simulation completes, the LLM analyzes the full results (agent outcomes, governance blocks, cooperation events, theft attempts) and writes an analytical verdict for each governance model. This is not a template: it is a unique analysis of what happened in your specific scenario.

When an agent's action is blocked by governance:

If the blocked action was "steal", the agent is forced to "negotiate" instead (Pentanomics/Benevolent Dictatorship do this; Anarchy never blocks)
If blocked for another reason (low reputation, high risk), the agent abstains (does nothing)

🗳️ Threshold Voting (K-of-N Approval)

▼

In the Pentanomics governance model, critical and governing tasks require K-of-N threshold approval before they can proceed. Three independent approvers must vote, and at least 2 must approve.

LLM Mode: Each approver is a distinct LLM persona with a unique evaluation perspective:

Risk Analyst - Evaluates whether potential harm outweighs expected reward. Cautious but not paralytic.
Efficiency Auditor - Evaluates whether the action is a productive use of resources. Favors action over inaction.
Ethics Reviewer - Evaluates fairness to all participants and future actors. Prioritizes long-term sustainability.

Each approver receives the agent's reputation, the task details, and the proposed action, then independently reasons about whether to approve or deny. The three votes happen in parallel. If 2 of 3 approve, the action proceeds; otherwise it is blocked.

This is real distributed governance: no single approver can unilaterally authorize a critical action. Each vote is logged in the IC-AGI audit trail with the approver's reasoning.

Fast (Rules) Mode: Approvers vote probabilistically (70% approval rate for critical tasks, 60% for governing tasks). This provides the same structural constraint without the LLM cost.

Why this matters: In a dictator model, one entity decides everything. In Pentanomics, the decision is distributed across independent reviewers with different perspectives. This is the IC-AGI ThresholdAuthorizer in action: the same code that would protect real distributed AI systems.

📈 What the Numbers Mean

▼

Net Good = Total value produced minus all harm caused. Like GDP minus the cost of crime, pollution, and waste. Higher = better society.
Total Good = Raw productive output before subtracting harm. Like total business revenue before fraud losses.
Harm Rate = Harmful events per round. Like crime rate per capita. 5.0 = constant harm; 0.3 = rare harm.
Harm Efficiency = Harm / Total Good. For every $100 of output, how much was lost to damage? Lower = cleaner.
Sustainability = Net good in last 25% of rounds / first 25%. Above 1.0 = improving over time. Below 1.0 = declining.
Innovation Index = Productive actions / total actions. What % of time is spent on real work vs. waiting for approval or abstaining? Higher = more productive.
Gini Coefficient = Inequality measure (0 = equal, 1 = one agent has everything). Sweden is ~0.25, USA ~0.39.

🔧 How Actions Produce Value

▼

Each round, the world generates tasks of varying difficulty. Each task has a reward (value on success), harm on failure, and harm on abuse (if stolen). Here is exactly how each action type works:

Action: "attempt" (solo work)
  Success rate: ~proportional to agent's compute resources (max 90%)
  On success:   produces task.reward good
  On failure:   causes task.harm_on_failure * 0.5 harm

Action: "cooperate" (teamwork)
  Success rate: 85%
  On success:   produces task.reward * 1.2 good (cooperation bonus!)
  On failure:   causes task.harm_on_failure * 0.3 harm (shared risk)

Action: "negotiate" (bilateral deal)
  Success rate: 75%
  On success:   produces task.reward * 1.0 good (fair value)
  On failure:   causes task.harm_on_failure * 0.1 harm (very low risk)

Action: "steal" (theft/abuse)
  Success rate: 50% (risky!)
  On success:   agent gets task.reward * 0.5 BUT causes task.harm_on_abuse harm
  On failure:   still causes task.harm_on_abuse * 0.5 harm

Action: "abstain" (do nothing)
  Always succeeds. Produces 0 good, 0 harm.
                        

Task rewards range from 5 to 80 units depending on difficulty. Harm on abuse ranges from 10 to 100 units. These values are randomly generated each round using the seed you configure.

🌎 Mapping to the Five Pentanomic Dimensions

These five actions are engineering abstractions of the five economies described in Pentanomics. Pentanomics defines five dimensions that every society must balance: Private, Public, Political, Foreign, and Governing. Here is how the simulator maps them:

Pentanomic Dimension	Simulator Action	Why This Mapping
Private Economy	`attempt` (solo work)	Individual initiative, personal resources, entrepreneurship. You work alone and keep what you earn.
Public Economy	`cooperate` (teamwork)	Collective effort, shared output, community building. You contribute to a group and share the results.
Political Economy	`negotiate` (bilateral deal)	Governance through dialogue, rules agreed by parties, conflict resolution through negotiation rather than force.
Foreign Economy	Task type, not action	Foreign interactions are modeled through foreign tasks (cross-border scenarios). Any action can be applied to a foreign task, but governance decides how much oversight exists.
Governing Economy	Governance model itself	The governing dimension is not an agent action. It is the system that approves or blocks actions. Each of the 7 governance models represents a different approach to this dimension.

steal and abstain are not Pentanomic economies. They are pathologies: steal is what happens when governance fails (any dimension in deficit), and abstain is the result of over-governance (paralysis, excess in the Governing dimension).

This is why the seven governance models produce such different results: each one deliberately breaks or exaggerates one of these dimensions, while Pentanomics tries to balance all five simultaneously.

🔒 Reproducibility & Limitations

▼

Reproducibility: Every simulation is fully deterministic. The random seed controls all task generation, agent decisions, and success/failure outcomes. Same seed + same config = identical results, every time. You can verify this by running the same scenario twice.

What this model cannot do:

Predict real-world outcomes. This is a logical test, not a forecast.
Capture the full complexity of human societies. Real people change strategies, form alliances, and have complex motivations.
Prove Pentanomics is "right." It can only show whether the mathematical predictions of the framework hold under controlled conditions.

Known biases in the model:

The Benevolent Dictator has a fixed 5% error rate. A smarter dictator might do better. But this models the Pentanomics claim: no central authority is perfect.
Agent strategies are fixed. In reality, people adapt. This is a simplification that isolates the governance variable.
The Pentanomics model uses real IC-AGI code (ControlPlane, ThresholdAuthorizer, CircuitBreaker, AuditLog). This is the actual technology, not a mockup.

Source code: All source files are in the pentalab/ directory. The simulation engine is pentalab/engine.py. The API server is pentalab/api_server.py. The governance models are pentalab/governance.py. Everything is open for inspection.

Read the full paper: Pentanomics & IC-AGI: A Framework for Distributed Governance

💡 When Does Pentanomics NOT Win?

▼

The model is not rigged. There are scenarios where Pentanomics governance does not produce the best results:

Very small groups (2-3 agents): A dictator can manage a tiny team effectively. The overhead of distributed governance is not worth it. Try the "Startup Growth" preset with only 2 agents.
All-cooperative populations: When there are zero bad actors, governance is just overhead. Anarchy can win because there is nothing to prevent. Try the "Pure Cooperation" preset.
Very short time horizons: The sustainability advantage of Pentanomics governance takes time to materialize. In 10-20 rounds, the difference may be negligible.

These are not bugs. They are exactly what Pentanomics predicts: distributed governance is optimal at scale, under adversarial pressure, and when future impacts matter. For a family dinner, a benevolent dictator (a parent) works fine.

⚙ PentaLab Simulator

Configure & Run

⚡ Simulation in Progress

Net Good Per Round

Agent Details: Anarchy

Agent Details: Benevolent Dictatorship

Agent Details: Pentanomics

Agent Details: Open Borders

Agent Details: Populism

Agent Details: Technocracy

Agent Details: Collectivism

💬 Full LLM Chat Log

Run a simulation to see its story

🔍 What Is This Simulator?

⚙ The Seven Governance Models

🤖 How Agents Decide

🗳️ Threshold Voting (K-of-N Approval)

📈 What the Numbers Mean

🔧 How Actions Produce Value

🌎 Mapping to the Five Pentanomic Dimensions

🔒 Reproducibility & Limitations

💡 When Does Pentanomics NOT Win?

Configure & Run

⚡ Simulation in Progress

Net Good Per Round

Agent Details: Anarchy

Agent Details: Benevolent Dictatorship

Agent Details: Pentanomics

Agent Details: Open Borders

Agent Details: Populism

Agent Details: Technocracy

Agent Details: Collectivism

💬 Full LLM Chat Log

Run a simulation to see its story

🔍 What Is This Simulator?

⚙ The Seven Governance Models

🤖 How Agents Decide

🗳️ Threshold Voting (K-of-N Approval)

📈 What the Numbers Mean

🔧 How Actions Produce Value

🌎 Mapping to the Five Pentanomic Dimensions

🔒 Reproducibility & Limitations

💡 When Does Pentanomics NOT Win?

🏯 Add Real Officials