Skill Body: A Capability-Centered Autonomous Intelligent Program Entity

Skill Body: A Capability-Centered Autonomous Intelligent Program Entity

—With Discussion on the Bionic Brain System and Cross-Domain Intelligent Management Skill Body

Author: Wu Xinneng, Gucheng School, Feidong County

Date: 2026-06-12 | Version: Academic Paper v3.0

Author: Wu Xinneng, Gucheng School, Feidong County


Abstract

Abstract:: Two dominant forms of capability carriers currently exist in artificial intelligence: Skill Packages and Agents. Skill Packages are static content collections (text, data, audio, images, video, templates) without autonomous execution capability; Agents are complete entities with autonomous action capability, but whose capabilities are bound within the actor and cannot circulate independently. This paper introduces the concept of "Skill Body" (技能体), defined as a capability-centered autonomous intelligent program entity—it is not static content, but a downloadable, installable, and autonomously executable intelligent code program with autonomous capability and rudimentary consciousness. Skill Bodies and Skill Packages together constitute the two major categories of capability trade: a Skill Package is "content about how to do it," whereas a Skill Body is "a program that can do it by itself." To support the operation of Skill Bodies, this paper further proposes the "Bionic Brain System" (仿生脑系统)—a dual-domain operating system for Skill Bodies that simultaneously manages the digital domain (cross-platform accounts and data) and the physical domain (bionic humanoid hardware and motor control), enabling Skill Bodies to execute seamlessly across both digital and physical worlds. The minimal viable form of the Bionic Brain System is the "Cross-Domain Butler Skill Body"—upon user authorization of platform accounts, this Skill Body autonomously operates across all authorized sites per instructions, effectively dissolving cross-platform barriers. This paper constructs an eight-stage model of capability externalization from the Tool Age to the Skill Body Age, elucidates the triple distinction among Skill Bodies, Skill Packages, and Agents, examines core attributes, Bionic Brain System architecture, cross-domain management mechanisms, embodiment mechanisms, copyright and commercialization frameworks, and the prospect of Skill Body networks, offering a new theoretical perspective for interdisciplinary research in AI and robotics.

Keywords: Skill Body; Skill Package; Autonomous Intelligent Program; Rudimentary Consciousness; Bionic Brain System; Embodied Intelligence; Cross-Domain Management; Capability Trade


Introduction

The history of human civilization is a history of the continuous externalization of capabilities. From crafting stone tools to extend physical abilities, to inventing computers to outsource logical operations, and then to artificial intelligence replacing cognitive functions, each leap has been accompanied by a fundamental transformation in the form of capability carriers.

Currently, there are two mainstream forms of capability carriers in the field of artificial intelligence:

Skill Package: A static content collection—text, data, audio, images, video, templates, guides. A Skill Package answers the question "about how to do it," providing knowledge and information, but does not act on its own. A cookbook is a Skill Package, a PPT template is a Skill Package, and an instructional video is also a Skill Package. The value of a Skill Package lies in "being read, referenced, and invoked," not in "autonomous execution."

Agent: A complete entity with autonomous action capabilities. The agent answers the question of "who does it"—it understands needs, formulates plans, invokes tools, and delivers results. However, an agent's capabilities are bound within itself and cannot be extracted for independent circulation. Agent A can write copy, Agent B can perform data analysis, but Agent A's copywriting ability cannot exist or be traded independently of Agent A.

This raises a fundamental question: Can there exist a capability carrier that is neither static content (like a skill pack) nor an attached capability bound to an actor (like an internal skill within an agent), but rather an autonomous intelligent program that can do it itself?

This paper proposes the concept of "Skill Body" as precisely the answer to this question. A Skill Body is a capability-centered autonomous intelligent program entity—it can be downloaded, installed, and run autonomously, possessing autonomous capabilities and rudimentary consciousness. Together with skill packs, they constitute the two major categories of capability trading: skill packs are "content about how to do it," while Skill Bodies are "programs that can do it themselves." On the Baigong AI platform (SkillBay), skill packs and Skill Bodies are traded side by side, meeting capability needs at different levels.

However, the operation of a Skill Body requires a hosting environment. Currently, the digital world is fragmented into countless platform silos—WeChat, Taobao, Douyin, Gmail, Salesforce—each independent and disconnected from one another; in the physical world, bionic robots lack a unified "brain" operating system to host installable skill programs. To address this fundamental issue, this paper further proposes the concept of "Bionic Brain System"—a dual-domain operating system for Skill Bodies—and its minimum viable form, the "Cross-Domain Butler Skill Body," as the first application instance of the Skill Body concept.


Triple Distinction: Skill Package, Agent, Skill Body

Definitional Comparison

Dimension Skill Package Agent Skill Body
Essence Static content collection Autonomous Action Entity Autonomous Intelligent Program Body
Form Text/Data/Audio-Video/Templates Complete AI System Installable and Runnable Code Program
Autonomy None — needs to be read/called Complete — understand/plan/execute Endogenous — autonomous perception/judgment/action
Consciousness None Task awareness Primary Consciousness (Self-State Perception)
Liveness Static — Not Executing Active — Continuous Operation Activated State — Autonomous Operation After Installation
Installability Downloadable but for Reference Only Deployed as a Service Downloadable and Installable on Any Compatible Carrier
Trade Attribute Right to Use Content Service Subscription Right to Use/Authorization of the Program
Embodiment None Controllable Hardware Automatic Docking with Bionic Basic Programs

SkillBay Dual Track: Skill Package + Skill Body

The SkillBay AI Platform is positioned as a capability trading platform with the motto "All Skills, One-Stop Access," trading in two major categories:

Skill Package Market: Trading static capability content—copy templates, data reports, design materials, audio/video tutorials, operation guides, etc. Buyers acquire the content for their own use, and the skill packages themselves do not execute autonomously.

Skill Body Market: Trading autonomous intelligent program bodies—AI capability programs that can be downloaded, installed, and run autonomously. After the buyer installs a Skill Body, the Skill Body autonomously executes tasks and delivers results. The Skill Body possesses primary consciousness, enabling it to perceive its own state, evaluate output quality, and adapt to the operating environment.

The logic of dual-track parallelism:

Intuitive Illustration

Taking "cooking" as an example:

Defining "Rudimentary Consciousness"

The hallmark feature that distinguishes a Skill Body from an ordinary program is "primary consciousness"—this is not self-awareness in the philosophical sense, but a functional self-state perception:

  1. State Awareness: The Skill Body is aware of its own operational status (idle/executing/abnormal/needs upgrade).
  2. Boundary Cognition: The Skill Body knows "what it can do" and "what it cannot do," and will not execute beyond its boundaries.
  3. Quality Self-Assessment: The Skill Body can evaluate the quality of its own execution results before output, and autonomously retry or report when unqualified.
  4. Environmental Adaptation: The Skill Body perceives changes in the operating environment (carrier hardware specifications, available resources, network status) and autonomously adjusts its execution strategy.
  5. Learning Tendency: The Skill Body accumulates experience from its execution history and optimizes subsequent execution paths.

This "primary consciousness" elevates the Skill Body from a "passive tool" to an "autonomous worker" — it no longer requires continuous external drive, but instead autonomously advances the task from assignment to delivery.


Related Work

Embodied Cognition and Embodied AI

Embodied cognition theory (Varela, Thompson & Rosch, 1991) proposes that cognition arises from the interaction between the body and the environment, rather than from the representation of a pre-given world. This theory lays the philosophical foundation for understanding 'how the body shapes cognition.'

Embodied AI extends this concept to the AI domain, asserting that intelligence originates from the interaction between the body and the environment. In 2024, Jensen Huang proposed the concept of "Physical AI," emphasizing that AI must understand and adhere to physical laws. In June 2026, China's Ministry of Industry and Information Technology and the State-owned Assets Supervision and Administration Commission launched a special action for humanoid robots and embodied AI in real-world training, elevating embodied AI to the national strategic level. Pudu Robotics introduced the PuduFM 1.0 foundational model and the PuduAgent platform, implementing the "One Brain, Multiple Embodiments" strategy.

However, embodied intelligence research focuses on 'how to give AI a body,' rather than 'how autonomous capability programs can exist and circulate independently of the body.' The concept of the Skill Body fills this gap—the Skill Body is a capability program that can be installed into any compatible body, where the body serves as the carrier and the Skill Body as the core.

Embodied AI Operating Systems

In 2026, the CCF Ubiquitous Operating System Open Community released the "Embodied Intelligence Operating System Technical White Paper," proposing the EAIOS (Embodied AI Operating System) architecture, which positions the operating system as a unified platform for the three logical spaces of "perception, cognition, and action" [11]. The core innovations of EAIOS lie in AI-native design, brain-body decoupling, and closed-loop collaboration, establishing an ecosystem division of labor similar to that of Android/iOS in the mobile internet era.

At the level of commercial practice, He Shi Thinking released the "Wu Jie OS" dual-brain architecture embodied system, adopting a design that combines bionic brain perception with intelligent brain logic, using a lightweight mathematical model (9 billion parameters) to achieve cognition of the physical world [12]. Yushu Technology launched "Tian Gong OS", and Yinhe Waterdrop launched "Jellyfish OS", both adopting the homologous cognitive architecture of CognitiveOS to deeply adapt to domestic hardware [13].

At the level of open-source ecosystems, the ROSClaw project (GitHub: ros-claw/rosclaw), open-sourced under the MIT license, constructs a three-layer closed-loop architecture for physical intelligence—the capability provision layer, the sandbox security layer, and the self-evolving closed-loop layer—enabling AI Agents to securely access real robots [14].

However, all the aforementioned operating systems focus on the physical domain—managing the perception, cognition, and movement of robots—and none involve cross-platform account management and autonomous operation in the digital domain, let alone unifying both within the same operating system framework. The Bionic Brain System is proposed precisely to fill this gap—it is the first Skill Body operating system that simultaneously manages both the digital and physical domains.

Cross-Platform Autonomous Operation

In 2026, Google launched Gemini Spark—a personal AI Agent running 24/7 in the cloud, autonomously managing Gmail, Calendar, and Tasks under user authorization [15]. The ai.com platform, created by the CEO of Crypto.com, allows personal AI Agents to perform concrete operations across multiple apps (stock trading, message sending, schedule management), realizing an autonomous workflow of "describe and execute" [16].

These systems demonstrate the technical feasibility of cross-platform autonomous operation, but they have fundamental limitations: Gemini Spark is confined to the Google ecosystem, ai.com relies on API integrations from various platforms, and both serve only the digital domain. They are not "Skill Bodies"—they are not downloadable, installable, independently distributable autonomous capability programs that can be installed into bionic humans.

Agent Skills

In October 2025, Anthropic launched Claude Skills, defined as 'a collection of folders containing instructions, scripts, and resources.' In 2026, the arXiv survey *A Comprehensive Survey on Agent Skills* defined Agent Skills as 'reusable procedural artifacts that encode specific 'how-to' knowledge for coordinating tools, memory, and runtime context under specific constraints' [3].

Agent Skills still essentially fall within the category of 'skill packages'—they are auxiliary tools/reference manuals for agents, requiring an agent to drive execution and lacking autonomy themselves. The fundamental difference between Skill Body and Agent Skills lies in the fact that Agent Skills are 'operation manuals for agents to use,' while Skill Body is 'a program that can act on its own.'

AI as Organizing Capability

Stelmaszak, Joshi & Constantiou (2025) published in JMS, proposing that AI is an 'organizational capability' rather than an entity [2]. This study deconstructs the essence of AI from an organizational management perspective, resonating with the 'capability-centered' approach of Skill Body. However, Skill Body goes a step further: it is not merely a theoretical perspective but materializes capability into an autonomous program entity that can be installed and run.


Formal Definition and Core Attributes

Formal Definition

Skill Body is a capability-centered autonomous intelligent program entity. It exists in the form of downloadable and installable code programs, possesses autonomous execution capabilities and rudimentary consciousness, and is not bound to any specific agent or hardware platform.

Formal representation:

SkillBody = ⟨I, O, C, A, E, H⟩

Where:

Core Attributes

Autonomy: Once a Skill Body is installed and activated, it can autonomously advance execution toward a given task objective without requiring continuous external drive. It autonomously perceives the environment, judges paths, selects strategies, handles anomalies, and delivers results. This is the hallmark feature that distinguishes a Skill Body from a skill pack — a skill pack is passive, while a Skill Body is active.

Rudimentary Consciousness: A Skill Body possesses functional self-state awareness — it knows whether it is running, whether output quality meets standards, whether the operating environment has changed, and where its own capability boundaries lie. This is not the philosophical "I think, therefore I am," but rather the engineering sense of "I know what I am doing and how well I am doing it."

Independence: A Skill Body is not bound to any specific intelligent agent, human, or hardware. It can run on a cloud server, be installed in the brain of a bionic human, or be embedded in an IoT device. Independence is the prerequisite for circulation.

Standardization: Skill Bodies have unified interface specifications and interaction protocols, enabling Skill Bodies from different sources to interoperate and combine within a unified framework.

Installability: Skill Bodies are distributed as downloadable program packages, allowing users (humans or agents) to install them like installing an App. After installation, they are automatically activated and enter a working state.

Tradeability: Skill Bodies have price attributes and can be bought, sold, rented, or subscribed to on the market. The object of the transaction is the 'right to use the capability program,' rather than the 'time of the actor.'

Composability: Multiple Skill Bodies can be orchestrated into composite capabilities through standard interfaces—a copywriting Skill Body calls a data analysis Skill Body to optimize headlines, and a cooking Skill Body calls a nutrition analysis Skill Body to pair ingredients.

Heritability: Skill Bodies do not disappear with their creator. After a person's thirty years of craftsmanship is encapsulated as a Skill Body, it can be preserved, copied, and iterated upon. This makes the digital perpetuation of human skills possible.

Embodiment Compatibility: The Skill Body, through an adaptation layer and embodiment interface, automatically connects to the basic motor control program of a bionic brain upon installation, becoming an organic intelligent component. When the same Skill Body operates on bionic humans of different specifications, it adaptively calibrates to the optimal execution parameters of that hardware.

4.3 Skill Body Rights Protection Mechanism

As independently distributable capability programs, the protection of Skill Body rights is a core prerequisite for industrialization. This section proposes a protection framework comprising a single carrier, tamper-proofing, and three modes.

Single Carrier Principle: The Skill Body uses the Baigong AI Platform as its sole legal carrier for distribution and transactions. Any distribution, forwarding, or resale outside the Baigong AI Platform constitutes infringement, and the Skill Body cannot operate normally in unauthorized environments.

Tamper-Proof Principle: Each Skill Body is embedded with dual verification codes—the creator's signature and the platform's signature. Any tampering with the core logic of the Skill Body will cause signature verification to fail, automatically degrading the Skill Body to a non-executable state.

Three Licensing Models:

ModelDescriptionSource VisibilityCostTypical Scenario
SaleThe buyer obtains perpetual usage rights / Perpetual rightsNot disclosedOne-time paymentProfessional Skill Body
RentalThe buyer obtains time-limited usage rights / Time-limited rightsNot disclosedPay periodicallyShort-term Use of High-Value Skill Body
Public GoodFree Release and Free Use / FreeNot Publicly DisclosedFreeBasic Tool-Type Skill Body

Three modes jointly constrained: source code not disclosed (non-open source), runs only on the Baigong AI platform, forwarding/secondary sales illegal.

Three-Layer Protection System:

  1. Legal Layer: EULA mandatory signing, prohibits forwarding/decompilation/secondary sales. First release enjoys plagiarism-check protection, infringers bear legal risks.
  2. Technical Layer: Dual signature verification (creator + platform) + runtime integrity check + encrypted storage + digital watermark traceability.
  3. Economic Layer: Three modes—sale, rental, and public welfare—cater to different scenarios, with platform profit-sharing incentivizing creation.

4.4 Runtime Model

The lifecycle of a Skill Body comprises five stages:

  1. Encapsulation: The creator encodes the knowledge, logic, and experience of a certain capability into a Skill Body program, defining I/O interfaces, behavioral boundaries, and quality standards.
  2. Distribution: The Skill Body is distributed through the market platform, where users browse, evaluate, and purchase/acquire it.
  3. Installation: The user downloads and installs the Skill Body program onto the target carrier (cloud server/bionic human/IoT device), with automatic environmental adaptation.
  4. Activation & Execution: After activation, the Skill Body enters a working state, receives tasks, and autonomously executes them until delivery.
  5. Evolution: The Skill Body accumulates experience from execution, continuously optimizing; the creator releases new versions; the user community contributes improvements.

Bionic Brain System: A Dual-Domain Operating System for Skill Bodies

The Problem: Absent Runtime Environment for Skill Bodies

As an installable autonomous intelligent program entity, the Skill Body requires an operating system environment to support its operation. However, there is currently a dual fracture:

Digital Domain Fracture: The internet is divided into countless platform silos—WeChat, Taobao, Douyin, Gmail, Salesforce—each operating independently, with non-interoperable accounts and non-shared data. A requirement such as "help me manage all social accounts" cannot be fulfilled by any single program under the current architecture, because each platform is a closed garden.

Physical Domain Fracture: Bionic robots lack a unified operating system. ROS/ROS2 is a middleware framework rather than a true OS; emerging systems such as EAIOS, Tiangong OS, and Wujie OS focus on the perception-cognition-action loop in the physical domain but do not address cross-platform operations in the digital domain.

Dual-Domain Gap: More critically, no existing system simultaneously manages both the digital domain and the physical domain. A Skill Body can either operate only in the digital world (e.g., Gemini Spark) or execute only in the physical world (e.g., ROS robot), without the ability to run seamlessly across domains.

Definition of Bionic Brain System

Bionic Brain System (BBS) is a dual-domain operating system designed for Skill Bodies, managing both the digital domain (cross-platform accounts and data) and the physical domain (bionic human hardware and motion control), enabling Skill Bodies to execute seamlessly in both the digital and physical worlds.

Formal representation:

BBS = ⟨D, P, S, K⟩

Where:

Relationship with Existing OS

The Bionic Brain System is not a replacement for EAIOS/ROS, but a hierarchical overlay:


┌─────────────────────────────────────────────┐
│           Skill Body                │  <-- User-installed capability program
├─────────────────────────────────────────────┤
│     Bionic Brain System)         │  <-- Dual-domain OS
│  ┌──────────────┐  ┌──────────────────┐      │
│  │  Digital Domain Manager  │  │   Physical Domain Manager    │      │
│  │ (Cross-platform accounts   │  │ (Hardware discovery/motion   │      │
│  │  autonomous operation)    │  │  control/perception fusion)  │      │
│  └──────────────┘  └──────────────────┘      │
│  ┌──────────────────────────────────────┐    │
│  │       Skill Body Runtime + Security Kernel          │    │
│  └──────────────────────────────────────┘    │
├─────────────────────────────────────────────┤
│    EAIOS / ROS2 / Wujie OS / Tiangong OS           │  <-- Existing physical-domain OS
├─────────────────────────────────────────────┤
│    Linux / RTOS / Hypervisor                │  <-- Base OS
└─────────────────────────────────────────────┘

The Bionic Brain System adds digital domain management capabilities and Skill Body Runtime on top of the existing physical domain OS, enabling Skill Bodies to acquire 'dual-domain traversal' capabilities.

Core Mechanisms

5.4.1 Master Authorization Mechanism

The highest authority of the Bionic Brain System belongs to the master. The master grants the Bionic Brain System operational permissions for specific platforms by entering an account password or through OAuth authorization. The scope of authorization can be precisely controlled—read-only/read-write/transaction/management. Any cross-domain operation of the Skill Body must undergo authorization verification, and the master can revoke the authorization at any time.

5.4.2 Cross-Domain Operation Engine

When the Skill Body needs to execute tasks across platforms, the Digital Domain Manager of the Bionic Brain System is responsible for:

  1. Authentication Proxy: Completing identity authentication for each platform on behalf of the Skill Body
  2. Operation Translation: Translate the unified operation intent of the Skill Body into platform-specific API calls or interface operations
  3. State Synchronization: Ensure state consistency for cross-platform operations
  4. Security Audit: All operations are fully logged, with real-time interception of abnormal operations

5.4.3 Skill Body Installation and Scheduling

The Bionic Brain System's Skill Body runtime is responsible for:

  1. Installation Verification: Verify the integrity, security, and compatibility of the Skill Body
  2. Resource Allocation: Allocate computing resources and operational permissions for each active Skill Body
  3. Task Scheduling: Priority management and conflict resolution during concurrent operation of multiple Skill Bodies
  4. Status Monitoring: Real-time monitoring of the operational status and resource consumption of the Skill Body
  5. Evolution Support: Provide an environment for experience storage and version management

5.4.4 Layered Memory Architecture

The Bionic Brain System draws on the layered memory mechanisms of biological brains, providing the Skill Body with a three-tier memory system:

  1. Working Memory: Current task context and immediate state, with limited capacity but extremely fast read/write speeds. Analogous to the prefrontal cortex of the brain. Automatically cleared upon task completion.
  2. Episodic Memory: Experience fragments accumulated by the Skill Body from its execution history. Analogous to the hippocampus, enabling the Skill Body to evolve and improve with use.
  3. Skill Memory: Efficient execution patterns solidified after repeated validation, analogous to procedural memory in the cerebellum and basal ganglia. Enables rapid execution without conscious thought; the Skill Body's conditioned reflexes originate from this.

Three-level memory collaboration: New task → Retrieve skill memory → Miss → Execute working memory → Write to experience memory → Repeated verification → Solidify into skill memory. This enables the Skill Body to develop proficiency through practice.

5.4.5 Conditioned Reflex Mechanism

The Bionic Brain System provides a conditioned reflex mechanism for the physical-domain Skill Body, ensuring rapid and safe responses in emergency scenarios. Analogous to the biological spinal reflex arc (instantaneous withdrawal of a hand upon touching a scalding object), the Bionic Brain defines three types of conditioned reflexes:

  1. Safety Reflex: When sensors detect danger signals, safety actions are triggered directly, bypassing the cognitive loop, reducing response latency from hundreds of milliseconds to tens of milliseconds.
  2. Economic Reflex: When resource consumption exceeds a threshold, non-critical tasks are automatically downgraded or suspended.
  3. Consistent Reflex: Automatically triggers conflict resolution strategies when cross-platform operations generate conflicts.

Conditioned reflexes are uniformly managed by the Bionic Brain System, eliminating the need for each Skill Body to implement its own safety logic, and reflex rules can be expanded through the accumulation of experiential memory. When bionic humans perform high-risk tasks such as cooking, driving, or surgery, millisecond-level reflex responses may prevent severe consequences.

5.4.6 Security Kernel

The security kernel of the Bionic Brain System ensures that:

  1. Skill Bodies cannot operate beyond the scope authorized by the owner.
  2. The Skill Body cannot access unauthorized platform accounts
  3. Skill Bodies are isolated from each other when necessary
  4. All operations are traceable, auditable, and reversible
  5. Conditioned reflex rules cannot be modified or disabled by the Skill Body itself

Evolution Path

The implementation of the Bionic Brain System follows an evolutionary path from simple to complex:

Phase 1: Cross-Domain Butler — Minimum viable system, containing only the basic functions of the digital domain manager. After user authorization, the Cross-Domain Butler Skill Body autonomously operates all authorized platforms according to instructions. This is the "kernel boot" of the Bionic Brain System.

Phase 2: Skill Body Marketplace — Based on the Cross-Domain Butler, the Skill Body runtime is added, enabling third-party Skill Bodies to be installed and run. The Baigong Platform provides distribution, installation, and scheduling of Skill Bodies.

Phase 3: Physical Domain Extension — Interfaces with physical domain operating systems such as EAIOS/ROS, adding a physical domain manager to enable Skill Bodies to perform physical operations on bionic humans.

Phase 4: Dual-Domain Fusion — The digital domain and physical domain seamlessly collaborate under a unified kernel, allowing Skill Bodies to simultaneously operate digital platforms and physical devices.


Cross-Domain Butler Skill Body: The Minimal Viable Form of Bionic Brain System

Concept Definition

Cross-Domain Intelligent Management Skill Body (referred to as the "Cross-Domain Butler") is the minimum viable form of the Bionic Brain System and the first application instance of the Skill Body concept.

After the user downloads and installs the Cross-Domain Butler, they authorize it with the account passwords or OAuth tokens of various platforms (WeChat, Taobao, Douyin, Gmail, Outlook, etc.). Subsequently, the Cross-Domain Butler autonomously logs into and operates all authorized platforms according to the owner's natural language instructions—sending emails, managing schedules, posting on social media, checking orders, and handling tasks. The Cross-Domain Butler possesses primary consciousness: remembering the owner's preferences, assessing operational risks, proactively reporting anomalies, and learning and optimizing from historical operations.

Cross-Domain Butler vs Existing Cross-Platform Tools

Dimension Zapier/Make/n8n Gemini Spark ai.com Cross-Domain Butler
Operation Mode Workflow Trigger 7×24 Autonomous Operation Instruction-Driven Autonomous Operation + Instruction-Driven
Platform Coverage API Integration (Limited to Open APIs) Google Ecosystem Multiple Apps but Limited to Partners User Authorization Equals Access
Access Method Developer Configures API Google Account Binding Partner App Connection Owner enters account and password
Installability Cloud SaaS Cloud SaaS Cloud SaaS Downloadable and Installable Skill Body
Autonomy Rule Execution Semi-Autonomous Semi-Autonomous Full Autonomy + Primary Consciousness
Physical Domain None None None Scalable to Bionic Humans

The core difference of the Cross-Domain Butler lies in:

  1. User Authorization Equals Access: No need for the platform to open APIs; after the owner authorizes the account, operations can be performed—a paradigm shift from "platform permission" to "owner authorization"
  2. Installability: It is not a cloud-based SaaS service, but a downloadable and installable Skill Body program, fully conforming to all definitions of a Skill Body
  3. Scalable to the Physical Domain: As the minimal form of the Bionic Brain System, it can evolve into a dual-domain OS

Technical Implementation Framework

The technical architecture of the Cross-Domain Butler consists of four layers:

Authentication Proxy: Securely stores the owner's credentials for various platforms and completes login authentication as needed. Supports authentication methods such as password login, OAuth authorization, and Cookie management. Credentials are stored encrypted and decrypted for use during Skill Body runtime.

Platform Adapter: Implements adapters for each target platform, translating unified operation intentions into platform-specific operation sequences. The adapter supports both API calls (preferred) and UI automation (fallback) modes.

Task Execution: Receives the owner's natural language instructions, decomposes them into cross-platform operation sequences, and orchestrates execution based on dependency relationships. Includes exception handling, retry mechanisms, and result verification.

Learning & Evolution: Learns the owner's preferences from operation history (e.g., 'use a formal tone when replying to emails', 'publish on Douyin at 8 PM'), continuously optimizing execution strategies.

Security and Privacy

As a Skill Body that holds all of the owner's platform credentials, security is the top priority for the Cross-Domain Butler:

  1. Principle of Least Privilege: Each operation only requests the necessary permissions, without pre-fetching data from unrelated platforms.
  2. Operation Approval Mechanism: High-risk operations (transfers, deletions, publications) require the owner's confirmation by default; the owner can configure a whitelist to bypass approval.
  3. Full Audit Trail: All operations are recorded, and the owner can view the complete operation log of the Cross-Domain Butler at any time.
  4. Credential Isolation: Credentials for each platform are independently encrypted and stored; a leak of one platform's credentials does not affect other platforms.
  5. The owner can revoke authorization at any time: Once revoked, the Cross-Domain Butler immediately loses its operational capability on that platform.

As SkillBay Skill Body #001

The Cross-Domain Butler is positioned as the 001st official Skill Body of the Baigong AI platform, carrying threefold strategic significance:

Proof of Concept: The Cross-Domain Butler is a perfect instance of the "Skill Body" definition—Skill Packs teach you how to operate various platforms (static content, cannot be learned), while Skill Bodies directly operate for you (autonomous programs, ready to use upon installation). Users can immediately perceive the essential difference between Skill Bodies and Skill Packs after installation.

Cold Start Product: The Cross-Domain Butler addresses a real high-frequency pain point—the complexity and fragmentation of cross-platform operations. As the first Skill Body of the Baigong platform, it possesses natural customer acquisition appeal.

Starting Point of the Bionic Brain System: The Cross-Domain Butler is Phase One of the evolutionary path of the Bionic Brain System. All subsequent Skill Bodies operate on the Bionic Brain System carried by the Cross-Domain Butler—the Cross-Domain Butler is not only the first Skill Body but also the runtime environment for Skill Bodies themselves. Just as the Linux kernel is both the first program to run on Linux and the foundation for all subsequent Linux programs.


Embodiment: From Digital Program to Physical Executor

Installation and Interface Mechanism

When the Skill Body program is installed into the computing unit of a bionic brain, it undergoes the following automatic docking process:

  1. Hardware Discovery: The adaptation layer of the Skill Body automatically scans the bionic human's hardware capability map—joint degrees of freedom, torque parameters, sensor layout, and actuator specifications.
  2. Capability Mapping: The 'action intention' in the core intelligence of the Skill Body is mapped to the physical execution sequence of the bionic human—for example, 'stir-frying' is mapped to 'right arm joint group rotation + left hand spatula coordination + force feedback adjustment'.
  3. Adaptive Calibration: Online calibration is performed based on the specific hardware parameters of the bionic human—adjusting movement amplitude for different arm lengths, modifying force curves for different torques, and tuning perception thresholds for different sensor accuracies.
  4. Organic Integration: The Skill Body deeply integrates with the basic motor control program of the bionic human, forming a functional module similar to the motor cortex in the human brain—not an external add-on, but an internalized capability.

"Same Skill Body, Different Bodies"

A cooking skill body can be installed into:

The same capability core, different physical carriers, produce different execution characteristics through adaptive calibration—just as the same musical score played by different instruments yields the same melody but distinct timbres.

Capability Hierarchy of Embodied Skill Bodies

Dual-Domain Skill Bodies under Bionic Brain System

Skill bodies running on the Bionic Brain System can be divided into three categories:

Pure Digital Domain Skill Body (L1): Such as copywriting skill body, data analysis skill body—operating only in the digital domain, manipulating information and data. The Cross-Domain Butler belongs to this category.

Pure Physical Domain Skill Body (L2-L5): Such as cooking skill body, cleaning skill body—operating only in the physical domain, executing physical operations through bionic humans.

Dual-Domain Skill Body: Such as the 'Home Butler Skill Body'—simultaneously operating in the digital domain (managing emails, paying bills) and the physical domain (cooking, cleaning), achieving seamless digital-physical coordination under the unified scheduling of the Bionic Brain System. This is the ultimate form of the skill body—truly realizing 'one program, two worlds.'


Paradigmatic Relationships

Three-Layer Progression

Skill Pack → Agent → Skill Body constitutes a three-tier progression of capability carriers:

First Layer: Skill Package — "Content about how to do things." Static, passive, requiring external activation. The externalized form of knowledge. On the Baigong platform, it is traded as a content commodity.

Second Layer: Agent — "Who does it." A complete actor, with capabilities bound within the actor. The externalized form of action.

Third Layer: Skill Body — "A program that can do things on its own." Autonomous, installable, and transferable. The externalized form of capability—capability exists independently of the actor for the first time. On the Baigong platform, it is traded in the form of program authorization.

These three layers are not substitutive but coexist. The Skill Package still holds value (referencing knowledge), the Agent still holds value (action framework), and the Skill Body opens a new dimension (independent capability entity). The Baigong platform simultaneously trades Skill Packages and Skill Bodies, which is the market manifestation of this coexistence.

Analogical Argument

Analogy Domain Skill Package Agent Skill Body
Traffic Roadmap (Tells you how to get there) Taxi (Takes you to your destination) Autonomous Driving Program (Drives itself)
Medical Treatment Medical Textbook General Practitioner Specialized Surgical Procedure (Installable on Any Surgical Robot)
Music Musical Score Band Portable Performance Algorithm (installed on any instrument for automatic performance)
Programming Technical Documentation Development Team Deployable Microservices (independently runnable and composable)
Operating System User Manual System Administrator Bionic Brain System (Cross-Domain Butler = Minimal OS)

Cross-Domain Butler as Minimal OS

The Cross-Domain Butler Skill Body is functionally equivalent to a minimal operating system:

OS Function Traditional OS Cross-Domain Butler
Resource Management CPU/Memory/Disk Platform Accounts/Data/Services
Process Scheduling Multi-Task Priority Multi-task Queuing and Concurrent Execution
I/O Abstraction Unified Hardware Driver Unified API/Interface Adaptation Across Platforms
Memory Management RAM Allocation and Reclamation Context/Preference/Historical Memory Management
Permission Control User/Group/Read-Write Owner Authorization/Operation Whitelist/Risk Approval
File System Directory Tree and Index Unified Cross-Platform Data Access and Storage

Cross-Domain Butler = Kernel Boot of the Bionic Brain System = Minimum Viable Version of the Skill Body Operating System. Just as the Linux kernel is both the first program of the OS and the foundation of all programs, the Cross-Domain Butler is both the first Skill Body and the runtime environment of Skill Bodies itself.


Copyright and Commercialization

Skill Body as Copyright Subject

As an original intellectual creation, the Skill Body meets the protection requirements of copyright law:

Different creators can develop different versions of the Skill Body for the same type of capability—the 'Sichuan Cuisine Skill Body·Chef Chen Edition' and the 'Sichuan Cuisine Skill Body·Chef Wang Edition' each possess unique techniques and styles, and each can be registered for copyright. The copyright protection of the Skill Body covers the 'programmatic expression of capabilities,' not the capabilities themselves—just as the copyright of a recipe protects the textual expression rather than the cooking method.

Commercialization Models

Skill Packages and Skill Bodies each have suitable commercialization models on the Baigong Platform:

Commodification of Skill Packages:

Commodification of Skill Body — Three Authorization Models (see Section 4.3 for details):

Common constraints for all three modes: Operates exclusively on the Baigong AI platform; forwarding/resale is illegal (principle of unique carrier); dual-signature protection (principle of non-tamperability).

Skill Body Marketplace

The Baigong AI platform serves as a trading market for skill packages and Skill Bodies, with core functions including:


Eight-Stage Model of Capability Externalization

Stage Era Core Externalization Essence Analogy
1 The Age of Tools Passive Extension of Physical Capabilities Human → Tool The Hammer Extends the Arm
2 The Age of Computers Outsourcing of Logical Operations Human → Computation Abacus → ENIAC
3 The Age of Networks Decentralization of Information and Collaboration Standalone → Network One Machine → Global
4 The Era of Artificial Intelligence Mechanization of Cognitive Abilities Perception → Judgment Recognition → Decision
5 The Era of Intelligent Agents Delegation of the Complete Action Chain Understanding → Execution Goal → Delivery
6 Skill Body Era Independence and Circulation of Autonomous Capability Programs Binding → Independence · Static → Autonomy Craft → Installable Program
7 The Era of Skill Networks The Internet of Abilities Individual → Network Skills Seeking Skills
8 The Era of Civilizational Inheritance Digital Perpetuation of Human Skills From Human → Perpetual Immortalization of Mastery

The uniqueness of Stage 6 lies in its simultaneous completion of a triple liberation:


Skill Body Networks and Civilization Inheritance

Skill Body Networks

When the number of Skill Bodies is sufficiently large and standardization is mature enough, they begin to autonomously discover, call upon, and orchestrate each other—a copywriting Skill Body autonomously invokes a data analysis Skill Body to optimize headlines, a cooking Skill Body autonomously invokes a nutritional analysis Skill Body to pair ingredients, and a legal Skill Body autonomously invokes a contract review Skill Body to verify clauses. This shifts from 'humans seeking skills' to 'skills seeking skills,' forming a collaborative network more powerful than any single intelligent agent.

Civilization Inheritance

A craft honed over thirty years by a person disappears with their death. But if this capability is encapsulated as a Skill Body—not merely recording 'how they did it' (which is a skill pack), but creating an autonomous program that 'can do it just like them'—then their ability can be preserved, replicated, and iterated beyond the constraints of flesh and time. The joinery skills of a carpenter, the diagnostic experience of an old Chinese medicine practitioner, and the teaching rhythm of a master educator no longer vanish with the individual. Skill Bodies elevate the digital perpetuation of human skills from 'recording' to 'rebirth.'

Network Effects of Dual-Domain Synergy

On the Bionic Brain System, the Skill Body network generates a unique dual-domain synergy effect: Skill Bodies in the digital domain process information and then trigger Skill Bodies in the physical domain to execute operations—'An order received via email → a data analysis Skill Body parses the requirements → a production scheduling Skill Body orchestrates the production line → an assembly Skill Body performs the assembly on a bionic human.' The closed loop between the digital and physical is no longer a human monopoly but an inherent operational mode of the Skill Body network.


Conclusion and Future Work

This paper proposes three core concepts and the relationships among them:

  1. Skill Body: A capability-centered autonomous intelligent program entity—downloadable, installable, and capable of autonomous operation, possessing autonomous abilities and primary consciousness. Threefold distinction: Skill Pack (static content) vs Agent (action entity) vs Skill Body (autonomous intelligent program entity). Dual-track Baigong: parallel trading of Skill Packs and Skill Bodies.
  1. Bionic Brain System: A dual-domain operating system for Skill Bodies, simultaneously managing the digital domain (cross-platform accounts and data) and the physical domain (bionic human hardware and motion control), enabling Skill Bodies to execute seamlessly in both the digital and physical worlds.
  1. Cross-Domain Butler Skill Body: The minimum viable form of the Bionic Brain System and the first application instance of the Skill Body concept. After the user authorizes accounts on various platforms, this Skill Body autonomously operates across all sites according to instructions. Functionally, it is equivalent to a minimal operating system, serving as the "kernel boot" of the Bionic Brain System.

Relationship among the three: The Cross-Domain Butler is Phase One of the Bionic Brain System, the Bionic Brain System is the runtime environment for Skill Bodies, and Skill Bodies are the traded items on the Baigong platform. From the Cross-Domain Butler to the Bionic Brain System to the Skill Body network, this constitutes a complete evolutionary pathway.

Future Research Directions:


References

Varela, F. J., Thompson, E., & Rosch, E. (1991). The Embodied Mind: Cognitive Science and Human Experience. MIT Press.

Stelmaszak, M., Joshi, M., & Constantiou, I. (2025). Artificial Intelligence as an Organizing Capability Arising from Human-Algorithm Relations. Journal of Management Studies. https://doi.org/10.1111/joms.70003

Zhou, J. et al. (2026). A Comprehensive Survey on Agent Skills: Taxonomy, Techniques, and Applications. arXiv:2605.07358.

Anthropic (2025). Claude Skills: Making AI Agents More Capable. https://www.anthropic.com

[5] Chinese Academy of Engineering (2026). Trends and Prospects of Embodied Intelligence. Chinese Engineering Science, 2026(2).

[6] Ministry of Industry and Information Technology, State-owned Assets Supervision and Administration Commission of the State Council (2026). Notice on the Special Action for Humanoid Robots and Embodied Intelligence Real-Scene Practical Training in 2026.

[7] NVIDIA (2024). Physical AI: The Next Frontier. Huang, J. Keynote at COMPUTEX.

[8] Pudu Robotics (2026). PuduFM 1.0 and PuduAgent: One Brain, Multiple Embodiments. https://www.roboticsbusinessnews.com

[9] Kuhn, T. S. (1962). The Structure of Scientific Revolutions. University of Chicago Press.

[10] Le Xiaoye (2026). Understanding the Representation, Execution, Evaluation, and Evolution of AI Agent Skills in 10 Papers. Tencent Cloud Developer Community.

[11] CCF Ubiquitous Operating System Open Community (2026). Technical White Paper on Embodied Intelligence Operating Systems (EAIOS).

[12] He Shi Thinking (2026). Embodied System with Dual-Brain Architecture of Object OS and Humanoid-X Launch Event. Sina Technology, 2026-06-01.

[13] CSDN (2026). From ROS to Cognitive OS, Agentic OS: Robot Operating System and the New Era of Embodied Intelligence. 2026-06-10.

[14] ROSClaw (2026). ROSClaw: Physical Intelligence Open Infrastructure. GitHub: ros-claw/rosclaw. MIT License.

[15] Google (2026). Gemini Spark: Always-On Personal AI Agent. https://blog.google

[16] ai.com (2026). Personal AI Agent Platform. Founded by Kris Marszalek.


The three concepts proposed in this paper—"Skill Body," "Bionic Brain System," and "Cross-Domain Intelligent Management Skill Body"—were first introduced and systematically elaborated by Wu Xinneng (Gucheng School, Feidong County) in June 2026. The concepts were initially published on the Baigong AI Platform (skillbay.cn). AI-assisted literature retrieval and text organization were employed, while the core concepts and theoretical framework were independently proposed by the author.