PSIL - Point-free Stack-based Interpreted Language

Changelog

Phase 15 — WFC Genome Generation on Z80: Diverse Brains from 8 Bytes of Constraints (2026-03-05)

NPCs no longer start as 16 identical eat-loops. The Z80 sandbox now uses 1D Wave Function Collapse to generate structurally diverse, valid genomes at startup — the same technique as the Go sandbox (Report 009), ported to 470 lines of Z80 assembly. Evolution starts with diverse material from tick 0.

8-type token system — Sense, Push, Cmp, Branch, Move, Action, Ops, Yield. Reduced from 10 types to fit a single byte bitmask, reusing the popcount LUT and constraint propagation from WFC biome generation. The entire constraint table is 8 bytes.
Archetype-mined constraints — transition rules extracted from 4 hand-decoded winning genomes (trader, forager, crafter, teacher). Tight constraints that encode behavioral logic: comparisons lead to branches, actions lead to sense-act cycles, yields restart with sensing.
Bidirectional propagation — forward pass constrains "what can follow"; backward pass constrains "what can precede." Forward-only hit contradictions on 14% of seeds. Adding backward propagation: 50/50 seeds succeed, zero contradictions.
PUSH DE; RET dispatch — 8 render handlers emit randomized concrete opcodes via jump table. Each handler picks from curated opcode pools (8 sensors, 4 comparisons, 6 actions, 8 stack ops) using the LFSR.
WFC vs random: sense→cmp→branch decision patterns appear 5.9x more often in WFC genomes than pure random (82 vs 14 per 200 trials in Go tests).

Generated Z80 genomes (from test harness):

G1: 8A 01 8C 00 8A 12 95 00 93 04 F1 26 95 00 93 01 F1  (17B)
    Sense→Move→Sense→Action→Move→Yield→Cmp→Action→Move→Yield

G3: 8A 0D 0E 10 88 04 97 00 8C 00 26 8A 05 93 06 F1    (16B)
    Sense→Ops→Cmp→Branch→Action→Move→Cmp→Sense→Move→Yield

G5: 8A 0D 22 02 24 03 08 03 25 94 00 F1                 (12B)
    Sense→Cmp→Ops→Cmp→Ops→Ops→Ops→Cmp→Action→Yield

Binary size: 6,032 bytes (+503 from Phase 14). The WFC genome engine is ~8% of the total.

# Test harness: generate 5 genomes with hex output
sjasmplus z80/test_wfc_genome.asm --raw=z80/build/test_wfc_genome.bin
mzx --run z80/build/test_wfc_genome.bin@8000 --console-io --frames DI:HALT

# Full sandbox with WFC-diverse initial population
sjasmplus z80/sandbox.asm --raw=z80/build/sandbox.bin
mzx --run z80/build/sandbox.bin@8000 --console-io --frames DI:HALT

See WFC Genome Generation on Z80 for the full story: archetype constraint mining, the backward propagation discovery, the lfsr_next HL-clobber bug, and detailed analysis of generated genomes.

Phase 14 — Z80 Sandbox: Evolutionary NPCs on a ZX Spectrum (2026-03-04)

The Go sandbox's evolutionary mechanics now run on a real Z80 CPU (ZX Spectrum 128K). 16 NPCs with bytecode genomes live in a 32x24 tile world, sense their 5x5 neighborhood, execute decisions through the micro-PSIL VM, and evolve through tournament selection with crossover and mutation. The entire system fits in 4,665 bytes of Z80 machine code.

5x5 sensor scan — each NPC scans 25 cells per tick to find nearest food, NPC, and item with Manhattan distance and cardinal direction. 31 Ring0 sensor slots, matching the Go sandbox's layout.
9-action dispatch via jump table — eat, attack, heal, harvest, terraform, share, trade, craft, teach. All implemented in Z80 assembly with energy costs, adjacency checks, and item effects.
Tournament-3 GA with single-point crossover (LDIR) and 3 mutation types (point/insert/delete). Dead NPCs are respawned with mutated offspring. Clone-if-1-alive fallback prevents extinction.
128K bank switching — genomes in Bank 0 ($C000), code+stack in Bank 2 ($8000-$BFFE), world state in Bank 5 ($5B00-$6FFF). Stack relocated from $FF00 (switchable bank — broken!) to $BFFE (always mapped).
9 VM action opcodes ($93-$9B) with smart direction resolution: 5=toward_food, 6=toward_NPC, 7=toward_item. A 7-byte genome can be a competent forager.
Performance: 6.7M Z80 instructions for 256 ticks. 18.8s wall time at 3.5MHz. Sensor scan is the hotspot at 43.8% — 25 cells x 2 grid lookups x 16 NPCs x 256 ticks. The GA is essentially free at 0.2%.

Feature	Go sandbox	Z80 sandbox	Notes
World	64x64	32x24	Fits Spectrum screen
NPCs	128	16	Tick budget limited
GA	Tournament-3, crossover, 3 mutations	Same	Feature-complete
Actions	9 + move	9 + move	Feature-complete
Sensors	31 slots, full scan	31 slots, 5x5 local	Nearsighted
Speed	~50k ticks/sec	~13.6 ticks/sec	3,600x slower
Binary	~2MB (Go)	4,665 bytes	400x smaller

# Build and run
sjasmplus z80/sandbox.asm --raw=z80/build/sandbox.bin
mzx --run z80/build/sandbox.bin@8000 --console-io --frames DI:HALT --max-frames 2000000

# Sample output
NPC Sandbox Z80 v2
T=16  A=16 F=146 E=9     # 16 alive, fitness 146, 9 eats
T=48  A=2  F=158 E=6     # die-off, GA respawning
T=128 A=7  F=258 E=1     # population recovering
T=224 A=10 F=394 E=1     # stable at ~10 NPCs, fitness growing

See Z80 Sandbox Phase 1 Report for full architecture, memory maps, profiling breakdown, and compromise analysis.

Phase 13 — Dynamic Brain Growth & Gas Scaling (2026-03-04)

Genomes hit the 128-byte cap by tick ~50k and stagnated. We added dynamic brain growth — the genome cap and gas limit automatically increase over deep time, giving evolution room to explore longer, more complex programs. Both parameters are flag-configurable and on by default.

Genome cap growth (--genome-grow 64 --genome-grow-every 50000) — max genome size increases by 64 bytes every 50k ticks. At 200k ticks: 128 → 320. Evolution immediately fills the new space with duplicated farming loops.
Gas scaling (--gas-grow 10 --gas-grow-every 70000) — gas limit increases by 10 every 70k ticks. More gas = more yields per tick. 4-yield brains emerge after the second gas increase.
Gene duplication discovered — with room to grow, evolution copies working 50-byte farming modules and specializes the duplicates. One copy stays on food; the other explores items. Textbook gene duplication, emergent in bytecode.
Seed 999 comparison (50 NPCs, 200k ticks):
- +71% best fitness (832k vs 487k) — larger brains harvest 10x more efficiently
- +151% avg fitness — the entire population benefits, not just the champion
- +968% harvests (627k vs 58k) — extra gas enables 4 harvest cycles per tick vs 2
- -77% stress (8 vs 35) — longer brains always find food before energy runs out
- genomeAvg 312 vs 110 — evolution uses 3x the space when given it
Backward compatible — pass --genome-grow 0 --gas-grow 0 to reproduce exact pre-growth behavior. Same seed + same flags = deterministic output.

# Default (growth enabled)
go run ./cmd/sandbox --npcs 50 --ticks 200000 --seed 999 --biomes

# Disable growth (old behavior)
go run ./cmd/sandbox --npcs 50 --ticks 200000 --seed 999 --biomes --genome-grow 0 --gas-grow 0

# Custom schedule: big jumps, less often
go run ./cmd/sandbox --npcs 50 --ticks 500000 --seed 999 --biomes \
    --genome-grow 128 --genome-grow-every 100000 --gas-grow 20 --gas-grow-every 50000

See Dynamic Brain Growth for full comparison, disassembled mega-brains, gene duplication analysis, and growth trajectories.

Phase 12 — Replay Observatory, Genome Injection & 200k-Tick Analysis (2026-03-04)

Simulations are no longer fire-and-forget. We now record full state to JSONL and replay it in a colored terminal UI with biome backgrounds, NPC markers, pause/step/speed controls, and a side-panel legend. We also added genome injection — load a hex genome file and spawn copies into a running simulation at any tick.

Simulation recording (--record file.jsonl --record-every N) — captures world grid, all NPC state, and cumulative stats every N ticks. The warrior999.jsonl recording: 2,000 frames across 200k ticks in 614 KB.
Replay player (tools/replay/) — colored terminal playback with biome backgrounds, NPC symbols (color-coded by item/HP), pause/step/speed controls. New side-panel legend shows all biome swatches, tile types, NPC markers, and keyboard shortcuts.
Genome injection (--inject file.hex --inject-count N --inject-at T) — inject hand-crafted or previously-evolved genomes into running simulations. Enables invasion scenarios, cross-seed genome transfer, and archetype stress tests.
200k-tick observatory (seed 999) — analyzed the full warrior999 recording:
- 50% population culling in the first 10k ticks, then rock-solid stability at ~96-100
- Compass monoculture: 75% of item holders converge on compasses; weapons go nearly extinct
- 147 million terraforms — every tile rewritten ~47,000 times
- Only 6 kills despite 5,976 attacks — healing outpaces damage 482:1
- Trading dies by tick 10k but teaching persists for 200k ticks
- Genome bloat: 93% of genomes hit the 128-byte cap
- An unexplained "Late War" burst of 2,000 attacks around tick 165k that kills nobody

# Record a simulation
go run ./cmd/sandbox --npcs 200 --ticks 200000 --seed 999 --biomes --wfc-genome \
    --record warrior999.jsonl --record-every 100

# Replay it
go run ./tools/replay warrior999.jsonl

# Inject a genome mid-run
echo "8a0d8c002181018c01f1" > forager.hex
go run ./cmd/sandbox --npcs 200 --ticks 100000 --biomes \
    --inject forager.hex --inject-count 20 --inject-at 50000

See Replay Observatory for full analysis with action rates, fitness distributions, and spatial clustering.

Phase 11 — Evolved Brains Decoded: What 100k Ticks of Evolution Produce (2026-03-03)

We disassembled the winning genomes from 6 simulation runs. Handcrafted archetypes are 17-27 bytes with 1-2 actions per tick. After 100k ticks of evolution, winning genomes are 42-128 bytes with 5-8 actions per tick and ~40% junk DNA. Evolution produces working programs — not elegant ones, but programs that harvest, terraform, trade, and fight.

The Compact Farmer (42 bytes, seed 999 × 400 NPCs) — harvest → move → terraform → trade → harvest. A 5-action farming loop in 24 effective bytes. The remaining 18 bytes are junk DNA that selection never removed.
The Trader-Crafter (89 bytes, seed 42) — craft → trade → trade → harvest → terraform → eat → trade. Seven yields per tick. A full economic agent: crafts items, trades obsessively, harvests, plants, eats.
The Terraform Warrior (128 bytes, seed 999 at 100k) — 80% farmer, 20% warrior. Core loop is terraform→harvest×6 with two act.attack opcodes buried at bytes 72 and 98. Attacks indiscriminately, farms obsessively. This genome was purged by tick 200k.
The Healer-Trader (94 bytes, seed 999 at 200k) — Zero attacks. Five harvest calls, two terraforms, one trade. Reads Ring0Similarity for kin awareness. The genome that replaced the warrior.
Scaling effect — 400 NPCs: 42-byte farmer wins. 200 NPCs: 94-byte healer-trader wins. More NPCs → simpler genomes, fewer fights, more tools (97 tools at 400 NPCs vs 39 at 200).

;; The Compact Farmer (42 bytes) — evolved, not designed
000  act.harvest              ; extract from tile
...
011  yield
012  act.terraform            ; plant food
...
020  act.trade                ; trade with neighbor
022  act.harvest              ; harvest again
024  yield
025  ifte                     ; ← dead code (junk DNA)

See Evolved Brains Decoded for full annotated disassembly of all winning genomes, junk DNA analysis, and scaling effects.

Phase 10 — Warriors Become Healers: Emergent Phase Transitions (2026-03-03)

Seed 999 produced a warrior culture at 100k ticks: 568k attacks, 7,907 kills. We doubled the run. The warriors self-corrected. Attacks dropped 93%, kills dropped 97%, heals exploded 15,450%, and trades went from 882 to 2 million. The most violent civilization became the most cooperative.

Warrior self-destruction — killing reduces trading partners, shrinks the economy. Warrior genomes score high short-term but create negative-sum environments.
One-byte mutation bridge — heal (0x95) is one byte from attack (0x94). Once healer mutants appear, they create positive-sum environments that outcompete warriors.
Kin selection ratchet — Ring0Similarity sensor lets genomes check genetic distance. As GA homogenizes the population, similarity triggers heal instead of attack. Self-reinforcing.
City-states emerged — 26-NPC and 15-NPC settlements at 200k ticks, with dense healing and trading networks.

Metric	100k (warrior phase)	200k (healer phase)	Change
Attacks	568,282	40,225	-93%
Kills	7,907	252	-97%
Heals	1,710	264,192	+15,450%
Trades	882	2,042,358	+231,500%

Cooperation is an evolutionary attractor. All 4 seeds converge to high-trade, high-heal, low-attack societies — seed 999 just took the scenic route through warfare first.

# Warrior seed at peak violence (100k)
go run ./cmd/sandbox --npcs 200 --ticks 100000 --seed 999 --biomes --wfc-genome

# Same seed after transition (200k)
go run ./cmd/sandbox --npcs 200 --ticks 200000 --seed 999 --biomes --wfc-genome

See Warriors Become Healers for full timeline, cluster analysis, and cross-seed comparison.

Phase 9 — Action Opcodes, Combat & Farming: The Complexity Explosion (2026-03-03)

Genomes were stuck at 20 bytes. A simple sense_food → move → eat → yield loop was the global optimum — no selective pressure for complexity. The fix: 9 two-byte action opcodes (0x93-0x9B), multi-yield coroutine brains, real combat damage, biome-based harvesting, terraforming, and food depletion. Average genome size jumped 4.5x from 24 to 109 bytes. Warrior cultures emerge on some seeds while peaceful farming civilizations emerge on others.

Action opcodes — act.attack, act.heal, act.eat, act.harvest, act.terraform, act.trade, act.craft, act.share, act.move. Each is 2 bytes (vs 9 for Ring1 writes). A single mutation discovers any action. Backward compatible with old genomes.
Multi-yield coroutine brain — yield no longer halts the VM. Scheduler executes the action, refreshes sensors, resumes brain. A 200-gas brain can execute 3-4 actions per tick.
Combat — Attack deals 5 + weapon_bonus - shield_defense damage. Item theft on kill. Heal restores 5 + tool_bonus HP. Ring0Similarity sensor enables kin selection.
Biome harvesting — Extract resources from tiles with per-tile cooldowns. Forest→food/tool. Mountain→weapon/crystal. Village→tool/treasure. Swamp→food/poison.
Terraform — Plant food on empty tiles. Clear forest/swamp. Costs 30 energy (tool reduces to 10).
Food depletion — FoodRate *= 0.999 every 100 ticks. Halves every ~70k ticks. Floor at 0.02. Forces foraging→farming transition.
3 new archetypes — Farmer (terraform + eat), Fighter (attack + forage), Healer (kin-selective heal). Total 7 archetypes seeded.

Key findings (4 seeds, 200 NPCs, 100k ticks):

Metric	Mean across seeds
Genome size	24 → 109 bytes (+4.5x)
GenomeMax	128 (cap hit on all seeds)
Terraforms	27.7M per run
Harvests	2.16M per run
Attacks	146k (range: 2.7k–568k)
Kills	2,011 (range: 6–7,907)
Heals	3,259
Best fitness	472k (vs 7.3k in Phase 8)

Stochastic divergence: seed 999 produced a warrior culture (568k attacks, 7,907 kills) while seeds 42/7 produced peaceful traders (634k–1.07M trades, <12k attacks). Same mechanics, different evolutionary trajectories.

500k-tick endgame (seed 42): FoodRate hit 0.02 floor. 537M terraforms. Gold collapsed to zero (trade worthless). Items shifted from compass to tool. Best fitness 941k. The farming phase transition is real.

# Standard run
go run ./cmd/sandbox --npcs 200 --ticks 100000 --seed 42 --biomes --wfc-genome

# Warrior seed
go run ./cmd/sandbox --npcs 200 --ticks 100000 --seed 999 --biomes --wfc-genome

# 500k endgame
go run ./cmd/sandbox --npcs 200 --ticks 500000 --seed 42 --biomes --wfc-genome

See Action Opcodes, Combat & Farming for full data, phase transition analysis, and WFC vs baseline comparison.

Phase 8 — WFC Genome Generation: Structure-Aware Brain Seeding (2026-03-03)

Random genomes are mostly noise — 7% start with a sensor read, 0% have valid branches, and 35% contain a sense→act pattern. The fix: use 1D Wave Function Collapse to generate structurally valid genomes by mining bigram constraints from evolved winners and handcrafted archetypes. This is an Estimation of Distribution Algorithm — learn what instruction patterns work, then sample from that distribution.

10 token types — Sense, Push, Cmp, Branch, Move, Action, Target, Stack, Math, Yield. Opcodes classified into functional categories; WFC operates on types, not raw bytes.
Constraint mining — bigram extraction from top 25% genomes each evolution round + base constraints from 4 archetypes, merged via bitwise OR. The EDA feedback loop improves constraints as evolution discovers new patterns.
1D WFC engine — left-to-right collapse with forward constraint propagation. First token anchored to Sense, last to Yield. Branch offsets computed by forward scan to next Yield.
--wfc-genome flag — opt-in. 60% WFC-generated / 40% archetype refill split. Fully backward compatible.

Structural quality (1000 genomes each):

Metric	Random	WFC (bootstrapped)
Starts with sensor	7%	99%
Sense→act pattern	35%	89%
Valid branches	0%	100%
Token diversity	6.1	7.2

Sim fitness (100 NPCs, 5000 ticks, seed 42):

Metric	Random-only	WFC-genome	Delta
Best fitness	3,942	7,269	+84%
Trades	208	788	+279%

See WFC Genome Generation for full metric explanations, token distribution analysis, and bootstrapping effects.

Phase 7 — WFC Biome Generation: Geography Breaks the Monoculture (2026-03-02)

The flat world problem from Phase 6 is solved. Wave Function Collapse generates spatially coherent terrain — rivers, mountains, villages, swamps — that creates the rugged fitness landscape evolution needs. Crafting up 4-10x. Tool monoculture broken. Crystal specialists emerge. Best fitness doubles at 1M ticks.

WFC biome engine — 7 biome types (clearing, forest, mountain, swamp, village, river, bridge), bitwise constraint propagation, anchor-seeded generation with BFS reachability verification. Ported from emergent-adventure/poc/wfc.
Biome-aware resource spawning — clearings: 60% food, no items. Mountains: 10% food, crystals + weapons. Villages: all items, forges. Swamps: 5% food, poison hazards. Each biome spawns its specialty.
River barriers with bridge chokepoints — rivers are impassable, bridges are the only crossing. Creates natural trade routes and population fragmentation.
Ring0Biome sensor (slot 26) — NPCs can read their current biome type, enabling evolution of biome-conditional behavior.
Swamp hazard — 5% chance per tick of -5 HP, +3 stress. Punishing to traverse.
--biomes flag — opt-in, fully backward compatible. Biome map printed in snapshots.

Key findings across 4 seeds, 100k ticks, 200 NPCs:

Metric	Biomes	Flat	Delta
Crafted items	19	4	+383%
Crystal NPCs	2	0	biomes-only
Compass holders	15	3	+364%
Tool holders	3	28	-88% (monoculture broken)
Trades	22k	29k	-23% (river fragmentation)

At 1M ticks (200 NPCs): best fitness 1,199 vs 569 (+111%). Biome specialists outlive generalists. The crafting pipeline (mountain crystals -> village forges -> compass/shield) actually functions end-to-end.

Biome Map (32x32, seed 42):
==TT####..HH^^TT^^^^TT####....TT    . = Clearing  T = Forest
........HH^^^^^^^^HH..TT####HH##    ^ = Mountain  ~ = Swamp
HH##TT..HH^^HHHH^^HHHH..HH..##..   H = Village   = = River
##......HHHH####TT####TT##TT##TT    # = Bridge

# Biome run with map
go run ./cmd/sandbox --npcs 200 --ticks 100000 --seed 42 --biomes --snap-every 25000

# Compare biomes vs flat
go run ./cmd/sandbox --npcs 200 --ticks 100000 --seed 42 --biomes 2>&1 | tail -5
go run ./cmd/sandbox --npcs 200 --ticks 100000 --seed 42 2>&1 | tail -5

See WFC Biome Generation for full data, 1M-tick results, and tuning proposals.

Phase 6 — Crossover Analytics & A/B Tuning (2026-03-02)

Growth/exchange crossover shipped. Now we know what it does — and what the real bottleneck is.

Growth/exchange crossover (7d0bc92) — novel crossover operator that finds instruction segments in parent B absent from parent A, inserts them before yield/halt (growth mode), or replaces a random block when at MaxGenome (exchange mode). 80/20 mix with classic fallback.
A/B comparison mode (--ab) — runs growth and classic crossover with identical seeds, prints side-by-side table + paired sparklines
Tunable classic rate (--classic-rate 0.20) — fraction of crossovers using classic single-point instead of growth/exchange
Crossover mode selection (--crossover growth|classic) — run either mode exclusively
Genome size tracking — genomeMin, genomeMax, genomeAvg sparklines and CSV columns show genome evolution over time
Comparison chart tool (tools/plot_ab_comparison.py) — 6-panel matplotlib chart comparing growth vs classic across all metrics

Key findings from 13-seed A/B comparison (200 NPCs, 10k ticks):

Growth crossover produces 35% more trades (wins 9/13 seeds, mean +394)
Average fitness is modestly higher (+45 mean, wins 8/13)
Peak fitness is high-variance — growth finds higher peaks but with wider misses
Genome size does NOT grow: both modes converge to ~19 bytes (the forager monoculture wins regardless)
The fitness landscape is the bottleneck, not the crossover operator

Growth (80/20 mix)	Classic (single-point only)

See Crossover Analytics for full A/B data, classic rate sweep, and proposals for making the fitness landscape more interesting.

Phase 5 — Temporal Statistics & Visualization (2026-03-01)

Time-as-axis output for watching simulations unfold. ASCII sparklines, CSV export, matplotlib charts, and Mermaid diagrams — all from a single --csv flag.

ASCII sparklines — 12 metrics + 2 rate-of-change lines printed on every run, auto-fit to ~80 terminal columns
CSV export (--csv) — pipe-friendly output to stdout for gnuplot, pandas, matplotlib
matplotlib charts (tools/plot_timeline.py) — 6-panel PNG/PDF/SVG: population, economy, rates, resources, fitness, crafting
Mermaid diagrams (tools/mermaid_timeline.py) — GitHub-renderable xychart-beta blocks + system flowchart
Guru leaderboard — top 5 NPCs by teach count in final stats
Delta rates — trades/t and teaches/t sparklines reveal activity bursts vs steady-state

Key findings from multi-scale temporal analysis (20 to 5,000 NPCs):

Trade per-capita rate stabilizes at ~1.6/1k ticks at scale (9x higher than 20-NPC village)
Stress follows a sawtooth synchronized with the 256-tick day cycle — only visible at 200+ NPCs
Guru NPCs (successful teachers) only emerge at 5,000+ NPCs
Population survival converges to ~27% across all scales above 200

See Temporal Dynamics for charts and analysis.

Phase 4 — Decouple Tiles from Occupants, Scale to 10,000 NPCs (2026-03-01)

The original tile encoding packed occupant ID in the high 4 bits, capping simulations at ~15 NPCs. Phase 4 removes this ceiling:

Tiles = pure terrain — Tile byte uses all 8 bits for type (256 possible). MakeTile(typ) takes one arg. Occupant() deleted.
Separate occupancy grid — OccGrid []uint16 parallel to Grid. OccAt() / SetOcc() / ClearOcc() for movement and collision.
NPC.ID = uint16 — monotonically increasing, 65535 max. No wrapping or reuse.
O(1) NPC lookup — npcByID map[uint16]*NPC replaces linear scans for target resolution, trade matching, and findNPC.
Cached tile counts — foodCount / itemCount maintained by SetTile(), eliminating full-grid scans.
Bounded Manhattan ring search — all Nearest* functions expand outward (radius 0-31, ~25 cells for typical hit) instead of scanning the full grid.
Combined NPC sensor — NearestNPCFull() returns distance+ID+direction in one scan, eliminating 3x duplicate scans per NPC.
Auto-scale world — AutoWorldSize(npcs) = max(32, sqrt(npcs)*4). Default --world 0 triggers auto-sizing. Maintains ~6% density at all scales.
Resource scaling — MaxFood = npcs*3, MaxItems = max(npcs/2, 4).
Cluster analysis skip — O(n^2) clustering guarded with population > 500.

Performance: 100 NPCs in 3.5s, 1000 NPCs in 41s, 10000 NPCs completes without crash. Trades and teaches occur at all scales. See Scaling to 10,000 NPCs.

Phase 3 — Max Age, Hazards, and Memetic Transmission (2026-03-01)

500k-tick simulations revealed frozen evolution — the best NPC lived 137k ticks, forager monoculture dominated all seeds, and trade was a one-shot event. Phase 3 breaks the stasis:

Max age (5000 ticks) — forces NPC turnover so the GA keeps running. No more immortal foragers stalling evolution. Best NPC age now capped under 5000.
Poison tiles — 1-in-10 item spawns are poison (15 damage on contact, consumed). Decay after 200 ticks. Ring0Danger (slot 6) reports nearest poison distance.
Blights — every 1024 ticks, ~50% of food is destroyed. Creates periodic famine beyond winter.
Memetic transmission (ActionTeach=6) — horizontal genome transfer. Adjacent NPCs copy 4-byte instruction-aligned fragments. Fitness-based probability (fitter teachers are more persuasive, fitter students resist). Costs 10 energy.
New sensors — Ring0MyAge (slot 24: remaining life), Ring0Taught (slot 25: times genome was modified).
Teacher genome seeded — 5% of population carries a teacher genome that teaches adjacent NPCs when holding an item.
Fitness rewards teaching — +TeachCount*15 in fitness formula.
Aged-out replacement in GA — NPCs at MaxAge are replaced even if not bottom 25%.

Result: trades re-emerge across all seeds (40-326 per 50k run, up from 1). Teach events 3-16 per seed. No NPC lives past 5000 ticks. See Simulation Observations.

Phase 2 — Evolve Smarter: Portable Crafting, Seasonal Scarcity, Mutation Fix (2026-03-01)

50k-tick simulations revealed a forager monoculture — every seed converged to the same 8-byte genome that moves toward food and eats. No NPC ever traded, crafted, or did anything complex. Phase 2 fixes three root causes:

Mutation blind spot fixed — randomOpcode() now generates ring ops (r0@, r1!) at 15%, so mutation can discover sensor reads and action writes. Constant tweak also adjusts 2-byte op operands.
Portable crafting — ActionCraft works anywhere (free on forge, 20 energy off forge). Auto-craft triggers when any NPC walks onto a forge tile with a craftable item. No bytecode needed.
Seasonal scarcity — Winter (last quarter of each 256-tick day cycle) stops all food spawning. Reduced MaxFood and FoodRate defaults. Tools/compasses now grant extended foraging radius (1 + ModForage), creating real survival advantage.
More forges — max(3, size/8) forges instead of 1-2, increasing auto-craft encounters.
Fitness rewards crafting — gold*20 (was *5), +craftCount*30, craft bonus +50 (was +20).
Gold inheritance — offspring get (parentA.Gold + parentB.Gold) / 4, preserving economic memory.
Crafter genome seeded — 10% of initial population carries a crafter genome that reads Ring0OnForge and crafts when possible.

Result: 7 of 8 item-holding NPCs now craft compasses. Zero crafted items in Phase 1. See Simulation Observations.

Phase 1a-1d — Modifiers, Crystals, Crafting, Economy, Stress (2026-03-01)

Modifier system: 4-slot fixed-size effect array (gas, forage, attack, defense, energy, health, stress)
Crystal tiles: consumed on pickup, grant permanent +50 gas with diminishing returns
Forge tiles and crafting: tool→compass, weapon→shield
Stress system: combat stress, starvation stress, eating/trading/resting relief, output override at high stress
Scarcity-based trade pricing: MarketValue = 10 * totalItems / countOfThisType
See Emergent Economies and Crafting

Phase 0 — NPC Sandbox: Evolving Bytecode Brains (2026-03-01)

32x32 tile world with food, items, NPCs
26-slot Ring0 sensors, 4-slot Ring1 actions
Genetic algorithm: tournament-3, instruction-aligned crossover, 6 mutation operators
Bilateral trade with item swapping
Seed genomes: forager, trader, random walker
Go + Z80 cross-validated (identical Ring1 outputs)
See Emergent NPC Societies — trade, knowledge, memetics, and deception on concatenative bytecode

micro-PSIL Bytecode VM (2026-01-11)

1,552-byte Z80 VM running concatenative bytecode
UTF-8-style opcode encoding (1-3 bytes)
4 test programs verified: arithmetic, hello, factorial, npc-thought
See micro-PSIL Bytecode VM and MinZ Feasibility

PSIL Language (2026-01-10)

Concatenative stack-based language inspired by Joy
Quotations, combinators, graphics, turtle graphics
See Design Rationale

PSIL is a concatenative, stack-based, point-free functional language inspired by Joy, designed for VM execution and targeting Z80/6502 compatibility.

Features

Stack-based execution - all operations work on an implicit stack
Quotations as first-class values - code blocks [ ... ] can be passed, stored, and composed
Point-free style - no named variables, only stack transformations
Hardware-inspired flags - Z flag for booleans, C flag for errors, A register for error codes
Rich combinator library - ifte, linrec, while, map, filter, fold
Graphics system - create and render images with shader-style programming
Turtle graphics - Logo-style turtle for L-systems and fractals
Math functions - sin, cos, sqrt, pow, lerp, clamp, smoothstep, etc.
micro-PSIL bytecode VM - compact bytecode for Z80/6502 implementation
CPS-compatible semantics - designed for easy compilation to bytecode

Quick Start

# Build
go build ./cmd/psil

# Run REPL
./psil

# Run a file
./psil examples/fibonacci.psil

# Run shader examples
./psil examples/shaders.psil

Language Basics

% Numbers push to stack
42 3.14 -5

% Strings
"Hello, World!"

% Stack operations
dup     % duplicate top: a -> a a
drop    % remove top: a ->
swap    % swap top two: a b -> b a
over    % copy second: a b -> a b a
rot     % rotate three: a b c -> b c a
roll    % n roll: bring nth item to top
pick    % n pick: copy nth item to top

% Arithmetic
+ - * / mod neg abs

% Math functions
sin cos tan sqrt pow log exp
floor ceil round min max
clamp lerp smoothstep fract

% Comparison (sets Z flag)
< > <= >= = !=

% Quotations (code blocks)
[1 2 +]     % pushes quotation, doesn't execute
i           % execute quotation: [Q] i -> ...
call        % alias for i

% Definitions (three styles)
DEFINE sq == [dup *].       % Joy-style
[dup *] "sq" define         % Point-free with string
[dup *] 'sq define          % Point-free with quoted symbol

5 sq .      % prints 25

% Conditionals
[cond] [then] [else] ifte

% Recursion
[pred] [base] [rec1] [rec2] linrec

Graphics System

PSIL includes a graphics system for creating and rendering images:

% Create 256x192 image
256 192 img-new

% Fill with color
255 0 0 img-fill            % fill with red

% Set individual pixels
dup 100 50 0 255 0 img-setpixel  % green pixel at (100,50)

% Render with shader quotation
% Shader receives: x y width height
% Shader returns: r g b
[
    drop drop               % x y
    swap 255 * 256 /        % r = x scaled
    swap 255 * 192 /        % g = y scaled
    128                     % b = constant
] img-render

% Save as PNG
"output/image.png" img-save

Shader Examples

The examples/shaders.psil file demonstrates various shader effects:

Gradient	Stripes	Checker	Plasma	Radial	Sphere

Run them with:

mkdir -p output
./psil examples/shaders.psil

Turtle Graphics

Logo-style turtle graphics for L-systems, fractals, and generative art:

256 192 img-new
0 0 0 img-fill
turtle                      % create turtle at center
255 255 0 pencolor          % yellow pen

5 [60 fd 144 rt] times      % draw a star

turtle-img "star.png" img-save

Turtle Examples

Square	Star	Spiral	Koch Curve

Sierpinski	Tree	Nested Squares	Colorful Circles

Run them with:

./psil examples/turtle.psil

Turtle Commands

Command	Effect	Command	Effect
`fd n`	Forward n pixels	`bk n`	Backward n pixels
`rt n`	Turn right n degrees	`lt n`	Turn left n degrees
`pu`	Pen up (stop drawing)	`pd`	Pen down (draw)
`pencolor r g b`	Set pen color	`setxy x y`	Move to position
`setheading n`	Set heading	`home`	Return to center

Example: Factorial

DEFINE fact == [
    [dup 0 =]           % predicate: n == 0?
    [drop 1]            % base case: return 1
    [dup 1 -]           % before recursion: push n-1
    [*]                 % after recursion: multiply
    linrec
].

5 fact .    % prints 120

Example: Fibonacci

DEFINE fib == [
    [dup 2 <]                         % n < 2?
    []                                % return n
    [dup 1 - fib swap 2 - fib +]     % fib(n-1) + fib(n-2)
    ifte
].

10 fib .    % prints 55

Example: Plasma Shader

DEFINE plasma-shader == [
    drop drop               % x y
    over 16 / sin           % sin(x/16)
    over 16 / cos +         % + cos(y/16)
    rot 8 / cos +           % + cos(x/8)
    swap 8 / sin +          % + sin(y/8)
    1 + 2 / 255 *           % normalize to 0-255
    dup 1.2 * 255 mod       % r
    swap dup 0.8 * 50 + 255 mod  % g
    swap 1.5 * 100 + 255 mod     % b
].

256 192 img-new
[plasma-shader] img-render
"output/plasma.png" img-save

Error Handling

PSIL uses hardware-inspired flags for error handling:

Z flag - set by boolean operations (true = Z set)
C flag - indicates error condition (true = error)
A register - holds error code when C flag is set

% Error codes:
% 1 = stack underflow
% 2 = type mismatch
% 3 = division by zero
% 4 = undefined symbol
% 5 = gas exhausted
% 7 = image error
% 8 = file error

% Check for errors
err?        % push C flag as boolean
errcode     % push A register (error code)
clearerr    % clear error state

% Try/catch pattern
[risky-code] [error-handler] try

REPL Commands

:help       Show help
:quit       Exit REPL
:stack      Show current stack
:flags      Show Z, C flags and A register
:clear      Clear stack and reset flags
:debug      Toggle debug mode
:words      List defined words
:load file  Load and execute a file
:gas n      Set gas limit (0 = unlimited)

Building for Development

# Run tests
go test ./...

# Run with debug mode
./psil -debug

# Set gas limit for computation
./psil -gas 10000

Builtins Reference

Stack Operations

dup, drop, swap, over, rot, nip, tuck, dup2, drop2, clear, depth, roll, unroll, pick

Arithmetic

+, -, *, /, mod, neg, abs, inc, dec

Math Functions

sin, cos, tan, asin, acos, atan, atan2, sqrt, pow, exp, log, floor, ceil, round, min, max, clamp, lerp, sign, fract, smoothstep

Comparison

<, >, <=, >=, =, !=

Logic

and, or, not

Quotation Operations

i, call, x, dip, concat, cons, uncons, first, rest, size, null?, quote, unit

List Operations

reverse, nth, take, ldrop, split, zip, zipwith, range, iota, flatten, any, all, find, index, sort, last

Combinators

ifte, linrec, binrec, genrec, primrec, tailrec, while, times, loop, map, fold, filter, each, step, infra, cleave, spread, apply

Graphics

img-new, img-setpixel, img-getpixel, img-save, img-width, img-height, img-fill, img-render, image?

Turtle Graphics

turtle, fd, bk, lt, rt, pu, pd, pencolor, setxy, setheading, home, turtle-img

I/O

., print, newline, stack

Error Handling

err?, errcode, clearerr, onerr, try

Definition

define, undefine

micro-PSIL: Bytecode VM for Z80

Why a Bytecode VM on a Z80?

The Z80 runs at 3.5 MHz with 48K of RAM. Every byte matters. A naive interpreter — tokenizing strings, hashing symbol names, walking tree structures — would burn most of that capacity on overhead. But a well-designed bytecode VM inverts the equation: the interpreter becomes a tight fetch-decode-execute loop, and the programs compress down to something approaching information-theoretic density.

micro-PSIL's encoding is modeled on UTF-8. The most common operations — stack manipulation, small arithmetic, boolean tests — are single bytes. A complete NPC decision ("if health < 10 and enemy nearby, flee; else fight") compiles to 21 bytes of bytecode plus quotation bodies. The entire VM core is 1,552 bytes of Z80 machine code. That leaves ~45K for game data, maps, and hundreds of NPC behavior scripts.

NPC Brains as Concatenative Programs

The real motivation is AI for NPCs — not the modern neural-net kind, but something closer to what "artificial intelligence" meant in the 1980s: small programs that make creatures seem alive.

In most retro games, NPC behavior is a hardcoded state machine: IF health < 10 THEN flee. The transitions are fixed at compile time. The designer writes every possible behavior. Nothing emerges.

micro-PSIL changes this by making behavior data. Each NPC carries a bytecode program — its "brain." The VM runs the brain each tick, the NPC reads its sensors (health, enemy distance, hunger), the brain computes a decision, the NPC acts. Different NPCs can carry different programs. A cautious goblin's brain might be:

'health @ 10 < 'enemy @ and [flee] [patrol] ifte    ; 12 bytes

An aggressive one:

'enemy @ [charge] [wander] ifte                      ; 6 bytes

The concatenative model makes this unusually powerful because composition is concatenation. You don't need a compiler, linker, or symbol resolver to combine behaviors — you literally append bytecode arrays. Want a goblin that checks hunger before checking for enemies? Prepend a hunger-check snippet:

brain_a = 'hunger @ 20 > [eat] [...] ifte   ; hungry? eat first
brain_b = 'enemy @ [charge] [wander] ifte   ; then fight or wander
brain_ab = brain_a ++ brain_b               ; just concatenate the bytes

No variable conflicts. No calling conventions. No scope. The stack is the only interface between the two fragments, and stack effects are local and composable. This is a property unique to concatenative languages — in any applicative language (C, Lisp, Python), combining two code fragments requires managing shared names.

Genetic Programming on a Z80

This composability opens the door to something that would be absurdly impractical in most languages: genetic programming on the Z80 itself.

A bytecode brain is just a byte array. You can:

Mutate it: flip a random byte (change + to *, change a constant, swap a quotation ref)
Crossover two brains: take the first half of parent A and the second half of parent B
Measure fitness: run the brain in a simulated tick, see if the NPC survived, found food, or died

Because the bytecode is well-formed at every granularity (every byte is either a complete instruction or a prefix that the VM knows how to skip), random mutations produce valid programs far more often than in tree-based representations. Single-byte instructions like dup, swap, +, < are atomic and self-contained. Even a completely random 20-byte sequence will execute without crashing — it might not do anything useful, but it won't segfault. The VM has a gas counter to prevent infinite loops.

This means you could run a genetic algorithm in-game, on the Z80:

A population of 20 NPCs, each with a 30-byte brain
Every N ticks, score them (survived? found food? killed enemy?)
Top 5 reproduce: crossover + mutation → 20 new brains
Total memory: 20 × 30 = 600 bytes for the entire population's genomes

After a few generations, the NPCs evolve behaviors the designer never wrote. The cautious ones learn to flee. The aggressive ones learn to charge. Some discover strategies like "flee when hurt, charge when healthy" — emergent ifte patterns that arise from selection pressure, not from a programmer typing if.

The entire genetic algorithm (selection, crossover, mutation, fitness evaluation) fits in maybe 200 bytes of Z80 code. The VM is already there. The bytecode programs are the genomes. There is no separate representation to maintain.

This is the kind of thing that was theoretically possible in the 1980s but never practical because game behavior was written in assembly — you can't mutate Z80 machine code and expect anything but a crash. A bytecode VM creates exactly the abstraction layer needed: a safe, compact, composable representation that can be both executed and evolved.

The Inner Loop

The concatenative model maps directly to the Z80's sequential execution. The VM's inner loop is:

fetch:  LD A, (bc_pc) / INC bc_pc
decode: CP $20 / JP C, command_table
        CP $40 / JR C, push_small_number
        ...

Quotations (code blocks) are stored as separate bytecode arrays referenced by index. [0] pushes quotation reference 0 onto the stack. exec pops it and runs it. ifte pops a condition and two quotation refs, runs one or the other. The Z80 implementation saves/restores the bytecode PC on the machine stack — quotation calls nest naturally using the same hardware stack the CPU already has.

Building and Running

# Go reference VM
go build ./cmd/micro-psil
./micro-psil examples/micro/arithmetic.mpsil
./micro-psil -disasm examples/micro/npc-thought.mpsil

# Compile to bytecode
go run tools/compile_mpsil/main.go -o z80/build examples/micro/arithmetic.mpsil

# Run on Z80 (via mzx emulator)
mzx --run z80/build/vm.bin@8000 \
    --load z80/build/arithmetic.bin@9000 \
    --console-io --frames DI:HALT

Bytecode Format

Range	Length	Usage
`00-1F`	1 byte	Commands (dup, swap, +, -, *, <, ifte, exec...)
`20-3F`	1 byte	Small numbers 0-31
`40-5F`	1 byte	Symbols (health, energy, enemy, fear...)
`60-7F`	1 byte	Quotation refs [0]-[31]
`80-BF`	2 bytes	Extended ops (push.b, jmp, jz, call builtin)
`C0-DF`	3 bytes	Far ops (push.w, far jumps)
`F0-FF`	1 byte	Special (halt, yield, end)

NPC Thought Example

; "If health < 10 AND enemy nearby, flee; otherwise fight"
8 5 !               ; health = 8
1 12 !              ; enemy = 1

'health @           ; load health       → 45 17
10 <                ; less than 10?     → 2A 0C
'enemy @            ; load enemy flag   → 4C 17
and                 ; both true?        → 0E
[0] [1]             ; [flee] [fight]    → 60 61
ifte                ; conditional       → 13
halt                ;                   → F0

The decision logic compiles to 10 bytes. The VM executes it, prints Flee!.

Z80 VM Architecture

Memory Map:
  $8000-$8FFF  VM code (1,552 bytes)
  $9000-$91FF  Bytecode program (loaded)
  $9200-$97FF  Quotation blob (loaded)
  $B000-$B0FF  VM value stack (128 × 16-bit entries)
  $B100-$B17F  Memory slots (64 × 16-bit)
  $B180-$B1BF  Quotation pointer table (32 entries)

I/O: OUT ($23), A via mzx --console-io (no ROM needed)

Test Results

All programs verified against the Go reference VM:

Program	Bytecode	Output	What it tests
arithmetic	49 bytes	`5 6 56 20 45 25 4`	Stack ops, +, -, *, /, dup, swap
hello	51 bytes	`Hello World!`	Character output via builtins
factorial	7 + 14 bytes	`120`	Recursive quotation, loop, dec, *
npc-thought	21 + 76 bytes	`Flee!`	Memory, ifte, 3 quotations

Prebuilt Binaries

The z80/build/ directory contains ready-to-run binaries:

File	Size	Description
`vm.bin`	1,552 B	Z80 micro-PSIL VM (load at $8000)
`arithmetic.bin`	49 B	Arithmetic test (load at $9000)
`hello.bin`	51 B	Hello World (load at $9000)
`factorial.bin`	7 B	Factorial main (load at $9000)
`factorial_quots.bin`	14 B	Factorial quotations (load at $9200)
`npc-thought.bin`	21 B	NPC thought main (load at $9000)
`npc-thought_quots.bin`	76 B	NPC thought quotations (load at $9200)

See micro-PSIL Design Report and MinZ Feasibility Analysis for details.

NPC Sandbox: Evolving Bytecode Brains

The theory from the sections above is now real. pkg/sandbox implements a complete genetic programming sandbox where a population of NPCs with bytecode genomes live in an auto-scaled tile world (32x32 to 400x400, ~6% density), sense their environment through Ring0 sensors, make decisions by running their genome on the micro-PSIL VM, act on the world through Ring1 outputs, and evolve via genetic algorithms. Scales from 20 to 10,000+ NPCs. The same simulation runs on both Go and Z80.

How It Works

Each tick of the simulation:

Sense — the world fills 26 Ring0 slots with NPC sensor data (health, energy, hunger, distances, items, stress, RNG, gas capacity, forge proximity)
Think — the NPC's bytecode genome runs on a fresh VM with gas = base + crystal bonus (diminishing returns, cap 500)
Stress check — if stress > 30, there's a (stress-30)% chance the brain's output is overridden with random movement/action
Act — the scheduler reads Ring1 outputs (move direction, action type, target) and applies them: movement, item pickup, crystal absorption, combat (with defense modifiers), trading, crafting
Auto-actions — NPCs passively eat food within foraging radius (1 + ModForage); auto-craft when standing on forge with craftable item
Modifiers — per-tick modifiers (energy, health, stress) apply; durations tick down; expired modifiers clear
Decay — energy drains; starvation (energy < 50) adds stress; resting (energy > 150) reduces stress
Economy — bilateral trades transfer items with scarcity-based gold pricing; trading reduces stress
Fitness — scored as age + food*10 + health + gold*20 + craftCount*30 - stress/5

Every N ticks, the GA replaces the bottom 25% with offspring from the top 50% via tournament selection, instruction-aligned crossover, and six mutation operators.

Ring0 Sensors (read-only, filled by world)

Slot	Name	Meaning
0	self	NPC ID
1	health	current health (0-100)
2	energy	current energy (0-200)
3	hunger	ticks since last ate
4	fear	nearest enemy distance
5	food	nearest food distance
6	danger	danger level
7	near	nearest NPC distance
8	x	X position
9	y	Y position
10	day	tick mod cycle
12	near_id	nearest NPC ID
13	food_dir	direction toward nearest food
14	my_gold	gold count
15	my_item	held item type
16	near_item	nearest item distance
17	near_trust	trust (stub)
18	near_dir	direction toward nearest NPC
19	item_dir	direction toward nearest item
20	rng	per-NPC random 0-31
21	stress	current stress (0-100)
22	my_gas	effective gas (base + crystal bonuses)
23	on_forge	1 if standing on forge tile
24	my_age	remaining life (MaxAge - age)
25	taught	times genome was modified by others
26	biome	biome type at position (0-6, biomes mode)

Ring1 Actions (writable, read by scheduler)

Slot	Meaning	Values
0	move	0=none, 1=N, 2=E, 3=S, 4=W
1	action	0=idle, 1=eat, 2=attack, 3=share, 4=trade, 5=craft, 6=teach
2	target	target NPC ID
3	emotion	emotional state

Modifier System

NPCs carry up to 4 concurrent modifiers — a flat, fixed-size effect system with no heap allocation. Items, tiles, and temporary buffs all share the same Modifier{Kind, Mag, Duration, Source} struct.

Kind	Constant	Example
Gas	1	Crystal (+50 gas permanently)
Forage	2	Tool (+1), Compass (+2)
Attack	3	Weapon (+10 damage)
Defense	4	Shield (+5 damage reduction)
Energy	5	Shrine buff (+10/tick temporary)
Health	6	Poison (-5/tick), Regen (+2/tick)
Stealth	7	Cloak (detection range)
Trade	8	Treasure (+3 gold per trade)
Stress	9	Combat stress (+15 one-shot)

Passive modifiers (Gas, Forage, Attack, Defense, Trade) are read at point of use via ModSum(kind). Per-tick modifiers (Energy, Health, Stress) are applied each tick. When an NPC picks up/trades/crafts an item, the old modifier is removed and the new one granted automatically.

Items, Tiles, and Crafting

Item	Type	Modifier	Source
Tool	2	Forage +1	Ground tile
Weapon	3	Attack +10	Ground tile
Treasure	4	Trade +3	Ground tile
Crystal	5	Gas +50 (permanent)	Rare ground tile (1-in-20), consumed on pickup
Shield	6	Defense +5	Crafted: Weapon on Forge
Compass	7	Forage +2	Crafted: Tool on Forge

Forge tiles (max(3, size/8) per world) are permanent landmarks. Crafting works anywhere: free on forge, costs 20 energy off forge. NPCs auto-craft when standing on a forge with a craftable item. Crafting grants +50 fitness and increments CraftCount.

Economy

Trade gold is proportional to scarcity: MarketValue = 10 * totalItems / countOfThisType. When two NPCs trade, the value difference flows as gold — the NPC receiving the rarer item pays more. This creates emergent price discovery.

Seed Genomes

Three hand-written seed genomes in testdata/sandbox/ demonstrate the sensor-action loop:

Forager (9 bytes) - moves South, always eats:

r0@ 5       ; read food distance
3           ; push 3 (South)
r1! 0       ; write move direction
1           ; push 1 (eat)
r1! 1       ; write action
yield

Flee (9 bytes) - moves North to escape:

r0@ 4       ; read fear distance
1           ; push 1 (North)
r1! 0       ; write move direction
0           ; push 0 (idle)
r1! 1       ; write action
yield

Random Walker (12 bytes) - direction changes with the day counter:

r0@ 10      ; read day counter
4           ; push 4
mod         ; day mod 4
1           ; push 1
+           ; +1 (directions are 1-4)
r1! 0       ; write move direction
1           ; push 1 (eat)
r1! 1       ; write action
yield

Running the Go Sandbox

# Quick test (100 ticks, 10 NPCs, auto-sized 32x32 world)
go run ./cmd/sandbox --npcs 10 --ticks 100 --seed 42 --verbose

# Full evolution run with economy (10k ticks, 20 NPCs)
go run ./cmd/sandbox --npcs 20 --ticks 10000 --seed 42 --verbose

# With spatial snapshots showing forge/crystal tiles
go run ./cmd/sandbox --npcs 20 --ticks 10000 --seed 42 --verbose --snap-every 2500

# Scale series (auto-sized worlds)
go run ./cmd/sandbox --npcs 100 --ticks 10000 --seed 42    # 40x40 world, ~3.5s
go run ./cmd/sandbox --npcs 1000 --ticks 10000 --seed 42   # 128x128 world, ~41s
go run ./cmd/sandbox --npcs 10000 --ticks 1000 --seed 42   # 400x400 world, ~69s

# Fixed world size (override auto-scale)
go run ./cmd/sandbox --npcs 100 --world 64 --ticks 5000 --seed 42

The verbose output shows NPC table with stress, gold, items (including crafted shield/compass), forge (F) and crystal (*) tiles on the map, and scarcity-based trade pricing. World size defaults to auto-scale (--world 0); set explicitly to override.

Timeline & Visualization

Every run prints ASCII sparklines showing how metrics evolve over time:

=== Timeline (sampled every 250 ticks, 80 points) ===
alive       [1000→278]  █▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁
trades      [23→31426]  ▁▁▁▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▂▃▃▃▃▃▃▃▃▃▃▃▃▄▄▄▄▄▄▄▄
teaches     [1→16484]   ▁▁▁▁▁▁▁▁▁▁▁▁▁▂▂▂▂▂▂▂▂▂▂▂▃▃▃▃▃▃▃▃▃▃▃▃▄▄▄▄
gold        [215→1279]  ▁▂▁▁▁▁▁▁▁▁▂▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁▁█▃▁▁▁▁▁▁
stress      [0→4]       ▁▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁▇▁█▁▇▁█

Export CSV data for detailed analysis with matplotlib or any plotting tool:

# CSV to stdout (sparklines still on stderr)
go run ./cmd/sandbox --csv --npcs 1000 --ticks 20000 --seed 42 > data.csv

# 6-panel matplotlib chart
python3 tools/plot_timeline.py data.csv -o timeline.png

# Mermaid diagrams for GitHub markdown
python3 tools/mermaid_timeline.py data.csv --flowchart -o charts.md

# Pipe directly — no temp files
go run ./cmd/sandbox --csv --npcs 500 --ticks 10000 2>/dev/null \
  | python3 tools/plot_timeline.py - -o chart.png

1,000 NPCs / 20k ticks	5,000 NPCs / 5k ticks

At 1,000 NPCs: 31,993 trades, 16,642 teaches, stress sawteeth synchronized with the 256-tick day cycle. At 5,000 NPCs: 39,129 trades, 17,931 teaches, 9 guru NPCs emerge who successfully transmit genome fragments to students. See Temporal Dynamics for the full analysis.

Running the Z80 Sandbox

The same simulation runs on the Z80 in 2,818 bytes of machine code:

# Assemble
sjasmplus z80/sandbox.asm --raw=z80/build/sandbox.bin

# Run (16 NPCs, 500 ticks, evolve every 128 ticks)
mzx --run z80/build/sandbox.bin@8000 --console-io --frames DI:HALT

Z80 output:

NPC Sandbox Z80
T=128 A=2
T=256 A=0
T=384 A=0
Done

The Z80 version runs 16 NPCs on a 32x32 grid with a simplified GA (tournament-2 selection, point mutation). The entire sandbox — VM, scheduler, GA, world grid, NPC table — fits in under 3K of Z80 code.

Genetic Algorithm

The GA engine (pkg/sandbox/ga.go) implements:

Tournament selection: pick 3 random NPCs, best fitness wins
Growth/exchange crossover (default): finds novel instruction segments in parent B that are absent from parent A, then either grows the genome by inserting before yield/halt, or exchanges by replacing a random block when at MaxGenome. Falls back to classic single-point crossover 20% of the time (tunable via --classic-rate). Produces 35% more trades than classic-only crossover across 13 seeds.
Classic crossover (selectable via --crossover classic): instruction-aligned single-point crossover — walks both parent genomes opcode-by-opcode to find valid split points, concatenates prefix of parent A with suffix of parent B
Six mutation operators:
1. Point mutation — replace one byte with a random valid opcode
2. Insert — add a random opcode at a random position
3. Delete — remove one byte (if genome > 16 bytes)
4. Constant tweak — find a small number (0x20-0x3F) and adjust by +/-1
5. Block swap — swap two instruction-aligned segments
6. Block duplicate — copy a short segment to another position

Genome size is enforced between 16 and 128 bytes. Genome size statistics (genomeMin/genomeMax/genomeAvg) are tracked in sparklines and CSV output.

Use --ab to run both crossover modes with the same seed and compare results:

go run ./cmd/sandbox --npcs 200 --ticks 10000 --seed 42 --ab

See Crossover Analytics for full analysis.

Cross-Validation

The seed genomes are cross-validated between Go and Z80 VMs (testdata/sandbox/crossval_test.go). Both VMs produce identical Ring1 outputs for the same genome and Ring0 inputs, confirming bytecode compatibility.

Files

File	Description
`pkg/sandbox/world.go`	Auto-scaled tile world, OccGrid, bounded search, biome integration
`pkg/sandbox/npc.go`	NPC struct, Ring0/Ring1 slot definitions
`pkg/sandbox/scheduler.go`	Tick loop: sense, think, act, decay, biome hazards
`pkg/sandbox/wfc.go`	WFC biome engine: 7 types, constraint propagation, anchors, reachability
`pkg/sandbox/ga.go`	Genetic algorithm engine
`pkg/sandbox/sandbox_test.go`	Unit + e2e + scaling tests (54+ tests)
`pkg/sandbox/wfc_test.go`	WFC generation, anchors, reachability, constraint, biome integration tests
`cmd/sandbox/main.go`	CLI runner with flags
`z80/sandbox.asm`	Z80 sandbox (scheduler + world + NPC init)
`z80/ga.asm`	Z80 GA (tournament-2, point mutation)
`testdata/sandbox/*.mpsil`	Seed genomes (forager, flee, random, trader)
`testdata/sandbox/crossval_test.go`	Cross-validation tests
`tools/plot_ab_comparison.py`	A/B crossover comparison chart
`z80/build/sandbox.bin`	Prebuilt Z80 binary (2,818 bytes)

Architecture

Source Code (.psil)          micro-PSIL (.mpsil)        Seed Genomes
    |                              |                        |
    v                              v                        v
Parser (Participle v2)       Assembler (pkg/micro)     GA Engine
    |                              |                   (pkg/sandbox/ga)
    v                              v                        |
AST (typed structs)          Bytecode (.bin)                v
    |                              |                   NPC Sandbox
    v                              v                   (pkg/sandbox)
Go Interpreter               Z80 VM (1,552 bytes)          |
    |                              |                   Sense → Think → Act → Evolve
    v                              v                        |
REPL / Files                 mzx emulator / real hw    Go + Z80

Documentation

Design Reports

See reports/ for detailed design documentation:

Report	Description
PSIL Design Rationale	Theoretical foundations and language design
micro-PSIL Bytecode VM	Bytecode format, encoding, Z80 implementation
micro-PSIL on MinZ	Feasibility analysis for MinZ compiler/VM
Emergent NPC Societies	Trade, knowledge, memetics, and deception on concatenative bytecode
Scaling to 10,000 NPCs	Decoupled tiles, bounded search, auto-scale architecture
Temporal Dynamics	Multi-scale temporal analysis with charts: villages to metropolises
Crossover Analytics	Growth/exchange vs classic crossover: 13-seed A/B comparison
Cross-Project Synthesis	Unifying PSIL + WFC geography + fractal narrative theory
WFC Biome Generation	Rivers create specialists: 4-10x crafting, crystal economy, 2x fitness at 1M ticks

Guides & Plans

Document	Description
ZX Spectrum Game Integration	Integrating micro-PSIL NPC brains into ZX Spectrum games
Emergent Societies Genplan	8-phase implementation plan: Go first, then Z80 port

Architecture Decision Records

See docs/adr/ for architectural decisions:

ADR	Title
ADR-001	UTF-8 Style Bytecode Encoding
ADR-002	Tagged Stack Value Format
ADR-003	Fixed Symbol Slots for NPC State

License

MIT

Name		Name	Last commit message	Last commit date
Latest commit History 34 Commits
cmd		cmd
docs		docs
examples		examples
pkg		pkg
reports		reports
testdata/sandbox		testdata/sandbox
tools		tools
z80		z80
.gitignore		.gitignore
README.md		README.md
go.mod		go.mod
go.sum		go.sum

Folders and files

Latest commit

History

Repository files navigation

PSIL - Point-free Stack-based Interpreted Language

Changelog

Phase 15 — WFC Genome Generation on Z80: Diverse Brains from 8 Bytes of Constraints (2026-03-05)

Phase 14 — Z80 Sandbox: Evolutionary NPCs on a ZX Spectrum (2026-03-04)

Phase 13 — Dynamic Brain Growth & Gas Scaling (2026-03-04)

Phase 12 — Replay Observatory, Genome Injection & 200k-Tick Analysis (2026-03-04)

Phase 11 — Evolved Brains Decoded: What 100k Ticks of Evolution Produce (2026-03-03)

Phase 10 — Warriors Become Healers: Emergent Phase Transitions (2026-03-03)

Phase 9 — Action Opcodes, Combat & Farming: The Complexity Explosion (2026-03-03)

Phase 8 — WFC Genome Generation: Structure-Aware Brain Seeding (2026-03-03)

Phase 7 — WFC Biome Generation: Geography Breaks the Monoculture (2026-03-02)

Phase 6 — Crossover Analytics & A/B Tuning (2026-03-02)

Phase 5 — Temporal Statistics & Visualization (2026-03-01)

Phase 4 — Decouple Tiles from Occupants, Scale to 10,000 NPCs (2026-03-01)

Phase 3 — Max Age, Hazards, and Memetic Transmission (2026-03-01)

Phase 2 — Evolve Smarter: Portable Crafting, Seasonal Scarcity, Mutation Fix (2026-03-01)

Phase 1a-1d — Modifiers, Crystals, Crafting, Economy, Stress (2026-03-01)

Phase 0 — NPC Sandbox: Evolving Bytecode Brains (2026-03-01)

micro-PSIL Bytecode VM (2026-01-11)

PSIL Language (2026-01-10)

Features

Quick Start

Language Basics

Graphics System

Shader Examples

Turtle Graphics

Turtle Examples

Turtle Commands

Example: Factorial

Example: Fibonacci

Example: Plasma Shader

Error Handling

REPL Commands

Building for Development

Builtins Reference

Stack Operations

Arithmetic

Math Functions

Comparison

Logic

Quotation Operations

List Operations

Combinators

Graphics

Turtle Graphics

I/O

Error Handling

Definition

micro-PSIL: Bytecode VM for Z80

Why a Bytecode VM on a Z80?

NPC Brains as Concatenative Programs

Genetic Programming on a Z80

The Inner Loop

Building and Running

Bytecode Format

NPC Thought Example

Z80 VM Architecture

Test Results

Prebuilt Binaries

NPC Sandbox: Evolving Bytecode Brains

How It Works

Ring0 Sensors (read-only, filled by world)

Ring1 Actions (writable, read by scheduler)

Modifier System

Items, Tiles, and Crafting

Economy

Seed Genomes

Running the Go Sandbox

Timeline & Visualization

Running the Z80 Sandbox

Genetic Algorithm

Cross-Validation

Files

Architecture

Documentation

Design Reports

Packages