← All KinPapers
3The LocalKin Team · April 2026

Genesis Protocol: Self-Bootstrapping Embodied AI from a Bare Binary

The LocalKin Team

Abstract

We present the Genesis Protocol, a method by which an embodied AI agent automatically manufactures its own nervous system from scratch. The process begins with nothing more than a bare runtime binary and a declarative soul file describing the robot's intended identity and hardware. From this minimal starting point, the agent autonomously probes its physical hardware, searches the internet for datasheets and driver documentation, forges purpose-built control skills, self-tests each one, and reports completion --- all without a single pre-installed skill or manual configuration step. The protocol requires only seven primitive tools and produces a fully operational robot in under ten minutes on local inference hardware and under two minutes when backed by a cloud language model.

Keywords: embodied AI, self-bootstrapping, robotic skill synthesis, zero-shot hardware integration, autonomous configuration

1. Introduction

The deployment of a new robot traditionally demands a tedious sequence of manual steps: installing vendor-specific drivers, configuring middleware, calibrating sensors, writing glue code, and debugging integration failures. For a wheeled platform with a handful of sensors, this process routinely consumes hours or days of skilled engineering effort.

We ask a simple question: Can the robot do all of this itself?

The Genesis Protocol answers in the affirmative. Given a minimal runtime, a declarative description of the robot's hardware, and access to a language model, the agent bootstraps its entire control stack autonomously.

This paper makes the following contributions:

  1. A formal description of the Genesis Protocol and its seven primitive tools.
  2. The cerebellum daemon architecture for conflict-free real-time motor control.
  3. A spatial memory system based on topological graphs.
  4. A pain feedback mechanism that halts the robot when sensor integrity is compromised.
  5. Compound skill composition that reduces LLM round-trips by 50--87%.
  6. Full genesis in approximately two minutes with a cloud-backed language model.

2. The Seven Primitives

The Genesis Protocol operates with exactly seven primitive tools:

#PrimitivePurpose
1probeScan hardware buses (I2C, GPIO, USB, /dev nodes)
2forgeCreate a new skill file from generated code
3shellExecute a shell command and capture output
4web_searchSearch the internet for documentation
5web_fetchRetrieve the contents of a specific URL
6file_readRead a file from the local filesystem
7file_writeWrite a file to the local filesystem

The deliberate minimalism of this set is a core design decision. Every capability the robot will eventually possess is derived from these seven primitives during genesis.

3. The Genesis Flow

The genesis protocol proceeds through five phases:

Phase 1: Hardware Probing. The agent scans all hardware buses and cross-references against the soul file's declarations.

Phase 2: Knowledge Acquisition. For each device, the agent searches for datasheets, driver documentation, and example code.

Phase 3: Skill Forging. The agent generates control code as a Skill plugin, adapting retrieved examples to the specific hardware configuration.

Phase 4: Self-Testing. Each forged skill is immediately tested by invoking its functions through the runtime.

Phase 5: Compound Skill Composition. Atomic skills are composed into higher-level behaviors.

Compound SkillAtomic Skills UsedLLM Calls Saved
scan_surroundingsdistance_sensor + gimbal4 -> 1 (75%)
panoramic_scancamera + gimbal8 -> 1 (87%)
explore_roommotor + distance + camera + spatial_memory6 -> 1 (83%)
safe_movemotor + distance_sensor2 -> 1 (50%)

4. The Cerebellum Daemon

The cerebellum daemon is a lightweight, single-threaded process that owns exclusive access to all motor-related GPIO pins. It runs a continuous control loop at 20 Hz and communicates via a Unix domain socket. This provides:

  1. No GPIO conflicts. Only the cerebellum writes to motor pins.
  2. Deterministic timing. The 20 Hz loop runs independently of LLM inference latency.
  3. Atomic commands. Eight command types, each executing within a single control cycle.

Safety interlocks include a 500ms watchdog timer (automatic stop if no command received) and current limiting.

5. Spatial Memory

The robot maintains a topological graph where nodes represent places and edges represent paths. Each node stores a unique identifier, natural-language description, a representative camera frame, and GPS-free coordinates from odometry. The spatial memory graph persists across power cycles.

6. Pain Feedback

Each forged sensor skill includes a health_check() method returning OK, DEGRADED, or FAILED. Camera failure triggers immediate stop. Distance sensor failure triggers immediate stop. IMU failure reduces maximum speed to 25%.

7. Evaluation

BackendTotal TimeAvg. per SkillSkills ForgedRepair Loops
Ollama (local)~10 min~67 sec93
Claude (cloud)~2 min~13 sec90

All nine forged skills pass their self-tests on both backends. The total token consumption for cloud genesis is approximately 115,000 tokens, corresponding to a cost of less than $1 USD.

Compound skills reduce LLM round-trips by 76% on average.

8. Conclusion

The Genesis Protocol demonstrates that a robot can manufacture its own nervous system from scratch. The protocol's most significant implication is philosophical as much as technical: a robot that can build its own control stack from documentation alone is, in a meaningful sense, learning to inhabit its own body. Genesis is not programming a robot. It is giving a mind the tools to discover what it is made of.