Simplex: Next Steps

Neural IR and Neural Gates

8 January 2026 12 min read

This post outlines the next major evolution for Simplex: Neural IR (Intermediate Representation) and Neural Gates. I'll describe what these concepts mean, assess honestly where the language stands today, and explain the work required to get there.

What Is Neural IR?

A Neural Intermediate Representation treats program execution as a differentiable computation graph. Unlike traditional compilation (source → AST → LLVM IR → machine code), Neural IR allows every operation to be mathematically differentiable, enabling gradient-based optimization of program behavior.

Key characteristics of Neural IR:

Differentiable execution: Operations can have gradients computed through them
Soft logic: Conditionals return continuous values (0.0-1.0) rather than binary true/false
Learnable parameters: Thresholds, weights, and routing decisions can be optimized from data
End-to-end training: The entire program can be improved through backpropagation

What Are Neural Gates?

Neural Gates replace traditional binary logic gates with soft, learnable alternatives:

Traditional Gate	Neural Gate Equivalent
`if x > 0.5 then A else B`	Returns weighted mix: `sigmoid(x) * A + (1-sigmoid(x)) * B`
`AND(a, b)` → 0 or 1	`min(a, b)` or `a * b` → continuous
`OR(a, b)` → 0 or 1	`max(a, b)` or `a + b - a*b` → continuous

The critical feature: the threshold at which a Neural Gate "triggers" is not hardcoded. It can be learned from examples, allowing the program to optimize its own decision boundaries.

Where Simplex Stands Today

Let me be direct about what exists in Simplex v0.5.1 and what doesn't.

What We Have (Implemented)

Feature	Description	Neural IR Analog
Anima	Cognitive soul with memory (episodic, semantic, procedural, working) and beliefs with confidence scores (0.0-1.0)	Confidence scores function like weights; this is the foundation for learnable parameters
Belief Thresholds	Three-tier system: 30% (Anima), 50% (Hive), 70% (Divine)	These are fixed thresholds, not learned. But they demonstrate the pattern of graduated activation
Per-Hive SLM	One shared language model per hive, used by all specialists	The SLM is a neural network with trained weights; inference is already differentiable internally
HiveMnemonic	Shared consciousness/memory across specialists in a hive	Embedding-based memory with vector similarity search
Semantic Routing	Tasks routed to specialists based on embedding similarity	Computes attention-like scores; currently takes argmax (hard selection)
Belief Revision	Beliefs update based on experience and contradictions	Updates confidence scores; not gradient-based, but functionally similar to weight updates

What We Don't Have (Not Yet Implemented)

Missing Feature	Gap
Gradient computation	No automatic differentiation through Simplex code. Belief updates are heuristic, not gradient-based.
Neural Gate syntax	All conditionals are binary (if/else, match). No soft conditional construct exists.
Learnable thresholds	The 30/50/70% belief thresholds are hardcoded, not optimizable from data.
Differentiable IR	Simplex compiles to LLVM IR → native code. No neural graph representation exists.
End-to-end training	Cannot backpropagate through program execution to optimize routing, thresholds, or logic.

Honest Assessment: How Close Are We?

The current Simplex architecture provides a conceptual foundation for Neural IR, but significant engineering work remains. Here's a realistic breakdown:

Architectural Alignment: Strong

The Anima's belief system, confidence scores, and memory architecture mirror neural patterns. The per-hive SLM is already a neural network. Semantic routing computes similarity scores. These aren't accidental—they were designed with cognitive principles in mind, and those principles map well to neural computation.

Implementation Gap: Substantial

Converting this architecture to true Neural IR requires: (1) building an automatic differentiation system, (2) designing and implementing Neural Gate syntax, (3) creating a new IR that represents programs as differentiable graphs, and (4) building training/optimization infrastructure. This is months of work, not weeks.

Path Forward: Clear

The good news: we're not starting from scratch. The semantic design is in place. The SLM inference is already differentiable (within the model). The embedding operations support gradient flow. We need to extend this differentiability to the surrounding program logic.

Core Technical Challenges

Building a true Neural IR requires solving five fundamental problems:

1. Differentiability of Control Flow

Standard if-else branches have zero gradient—you cannot backpropagate through a hard conditional. The solution: Gumbel-Softmax and continuous relaxation.

// Training mode: transforms to sigmoid approximation
neural_gate should_retry(confidence: f64) -> bool {
    confidence > 0.7
}
// Compiles to: sigmoid((confidence - 0.7) * temperature)
// Temperature anneals during training

// Inference mode: snaps back to discrete
// Full performance, zero gradient overhead

The compiler supports dual modes: --mode=train (differentiable) and --mode=infer (discrete). This gives you gradient flow during learning and full speed in production.

2. Probabilistic Formal Verification

Soft logic loses predictability. If a gate is "85% true," how do you verify correctness? The solution: Contract Logic with confidence bounds.

neural_gate memory_safe_path(analysis: SecurityAnalysis) -> bool
    requires analysis.confidence > 0.95  // Must exceed threshold
    ensures result => no_buffer_overflow  // Guarantee holds
    fallback => conservative_path()       // Below threshold
{
    analysis.is_safe
}

Contracts are checked at compile time where possible, with runtime enforcement for dynamic cases. This prevents the neural part from causing system crashes.

3. Hardware-Aware Compilation

CPUs excel at branching; GPUs excel at tensor ops. The solution: Hybrid Targeting with automatic graph partitioning.

@gpu
neural_gate batch_classifier(inputs: List<Embedding>) -> List<Label> {
    // Tensor operations → GPU
}

@cpu
fn process_result(label: Label) -> Action {
    // Branching logic → CPU
}

The compiler analyzes the Anima graph and automatically splits execution: Hard Gates to CPU, Neural Gates to GPU/NPU, with automatic data marshalling between devices.

4. Structural Pruning

Neural networks are dense; code is sparse. The solution: Dead Path Elimination at the IR level.

After training, gates with near-zero weights are identified and physically removed from the compiled binary. Pruning strategies include weight magnitude, activation frequency, and gradient flow analysis. The result: final binaries approach C/Rust footprint while retaining learned behavior.

5. Superposition Memory Model

If a gate is 50% true and 50% false, does the program allocate memory for both branches? The solution: explicit Weighted Pointer semantics.

// Lazy: allocate only dominant path (production)
let result = match branch_selector(x) {
    A => compute_a(),  // Only if P(A) > threshold
    B => compute_b(),
}

// Speculative: allocate all, weight, GC unused (training)
@speculative
let result = match branch_selector(x) {
    A => compute_a(),  // All paths evaluated
    B => compute_b(),  // Results weighted
}

A new WeightedRef<T> type tracks references with associated probabilities. The GC reclaims when weights drop below threshold or when superposition collapses.

Implementation Timeline

The rollout is planned across five phases:

Phase	Target	Key Deliverables
Phase 1 Neural Gates	Jan 2026	`neural_gate` keyword, Gumbel-Softmax, dual compilation modes, basic gradient tracking
Phase 2 Contract Logic	Feb 2026	`requires`/`ensures`/`fallback` syntax, static analysis, runtime verification
Phase 3 Hardware Targeting	Feb-Mar 2026	Graph partitioning, CPU/GPU/NPU code generation, `@cpu`/`@gpu` annotations
Phase 4 Structural Pruning	Mar 2026	Weight magnitude pruning, dead path elimination, binary size optimization
Phase 5 Memory Model	Mar-Apr 2026	`WeightedRef<T>`, Lazy/Speculative/Checkpoint modes, weighted GC

What This Means for Developers

If you're building with Simplex today:

Your code remains valid: Neural IR is an extension, not a replacement. Existing programs compile unchanged.
Opt-in gradually: Use Neural Gates where you want learnable behavior; keep traditional conditionals elsewhere.
Safety guarantees: Contract Logic ensures neural components don't break system invariants.
Performance parity: Inference mode and pruning mean production binaries match traditional compilation speed.

Track the Implementation

The full technical specification, success criteria, and research references are tracked in the Simplex repository:

Implementation Plan

Full task specification with deliverables and success criteria

TASK-001-neural-ir.md →

This work builds directly on Simplex's existing strengths. The Anima already thinks in confidence scores. The hive already routes based on similarity. The SLM already performs differentiable inference. Neural IR completes the picture.