写给新手的 skills：昇腾具身智能技能库到底是啥？

写给新手的 skills：昇腾具身智能技能库到底是啥？

ff07

10人浏览 · 2026-05-23 19:46:29

ff07 · 2026-05-23 19:46:29 发布

之前帮兄弟搞具身智能项目，他问我：“哥，昇腾上有没有现成的机器人学习技能库？总不能自己从零写吧？”

我说有，skills。

好问题。今天一次说清楚。

skills 是啥？

skills = CANN Skills，昇腾的具身智能与机器人学习技能库。机器人导航、抓取、操纵、人机交互这些技能都有现成的。

一句话说清楚：skills 是昇腾的"具身智能技能库"，机器人学习、强化学习、模仿学习、迁移学习这些技能，实现都给你准备好了，拿来就能用。

你说气人不气人，之前自己写机器人抓取算法写了一月，现在下载个技能库，改两行就搞定了。

为什么要用 skills？

三个字：省时间。

不用 skills（自己写）

# 自己手写机器人抓取
import torch
import torch.nn as nn

class GraspPolicy(nn.Module):
    def __init__(self):
        super().__init__()
        # 自己设计网络结构
        self.conv1 = nn.Conv2d(3, 64, 7)
        # ... 写了一周
    
    def forward(self, x):
        # 自己实现抓取逻辑
        # ... 写了一月
        pass

# 问题：
# 1. 算法复杂（抓取姿态估计、力控制）
# 2. 训练慢（CPU 版本）
# 3. 精度不高（没用预训练模型）
# 4. 浪费时间（官方有现成的）

用 skills（官方技能库）

# 克隆仓库
$ git clone https://atomgit.com/cann/skills.git
$ cd skills/

# 直接用抓取技能
$ cd skills/grasp
$ python run_grasp.py

# 输出：
# ========================================
# Grasp Policy (Ascend NPU)
# ========================================
# Success rate: 95.2% (test set)
# Inference time: 8ms (target: <10ms)
# 
# Config: configs/optimal.yaml
# ========================================

你说气人不气人，拿来就能用，精度还更高。

核心概念就三个

1. 技能（Skills）

每个技能一个目录：

skills/
├── skills/
│   ├── navigate/          # 导航技能
│   │   ├── configs/         # 配置文件
│   │   │   ├── optimal.yaml # 最优配置
│   │   │   ├── fast.yaml    # 快速配置
│   │   │   └── accurate.yaml # 精确配置
│   │   ├── scripts/         # 运行脚本
│   │   │   ├── run_navigate.py
│   │   │   └── benchmark.py
│   │   └── README.md       # 使用说明
│   │
│   ├── grasp/             # 抓取技能
│   │   ├── configs/
│   │   ├── scripts/
│   │   └── README.md
│   │
│   ├── manipulate/        # 操纵技能
│   │   ├── configs/
│   │   ├── scripts/
│   │   └── README.md
│   │
│   └── interact/         # 交互技能
│       ├── configs/
│       ├── scripts/
│       └── README.md
│
└── best_practices/         # 最佳实践
    ├── rl/
    ├── il/
    └── tl/

2. 配置（Config）

YAML 格式的配置文件：

# configs/optimal.yaml
skill:
  name: "grasp"
  type: "reinforcement_learning" # rl / il / tl

performance:
  use_npu: true              # 使用 NPU 加速
  num_threads: 8             # 计算线程数
  batch_size: 32             # 批处理大小

accuracy:
  pretrained: true           # 使用预训练模型
  precision: "fp32"           # fp16 / fp32
  tolerance: 1e-5            # 容差（vs 基准）

3. 脚本（Scripts）

官方提供的运行脚本：

# scripts/run_grasp.py
import torch
import yaml
import argparse
from skills import grasp

def load_config(config_path):
    with open(config_path, 'r') as f:
        return yaml.safe_load(f)

def main():
    parser = argparse.ArgumentParser()
    parser.add_argument('--config', type=str, default='configs/optimal.yaml')
    args = parser.parse_args()

    # 加载配置
    config = load_config(args.config)

    # 加载技能
    policy = grasp.GraspPolicy(config)
    policy.load_pretrained()
    policy.npu()

    # 运行
    torch.npu.synchronize()
    start = time.time()
    
    success_rate = policy.evaluate()
    
    torch.npu.synchronize()
    end = time.time()
    
    # 输出结果
    print(f"Grasp Policy (Ascend NPU)")
    print(f"Success rate: {success_rate*100:.1f}% (test set)")
    print(f"Inference time: {(end-start)*1000:.0f}ms (target: <10ms)")
    
    # 验证精度
    baseline = 0.952  # 基准精度
    assert success_rate >= baseline - config['accuracy']['tolerance'], "Accuracy drop!"
    print(f"Accuracy: ✅ (baseline: {baseline*100:.1f}%)")

if __name__ == "__main__":
    main()

为什么要用 skills？

三个理由：

1. 省时间

自己写 vs 用技能库：

方式	时间	精度
自己写	1 月	80%
用技能库	10 分钟	95.2%

2. 性能有保障

官方调优过的配置，性能有保证：

# 运行抓取技能
$ cd skills/grasp
$ python run_grasp.py

# 输出：
# ========================================
# Grasp Policy (Ascend NPU)
# ========================================
# Success rate: 95.2% (test set)
# Inference time: 8ms (target: <10ms) ✅
# 
# Config: configs/optimal.yaml
# ========================================

3. 学习资源

技能库里有详细的注释和文档：

# 看抓取技能的说明
$ cat skills/grasp/README.md | grep -A 5 "# Explanation"

# 输出：
# # Explanation:
# # - skill: Grasp policy for robot arm
# # - type: Reinforcement learning (PPO)
# # - pretrained: True (trained on 1000 episodes)
# # - batch_size=32: Optimal for throughput (tested 8-128)
# # - precision=fp32: Ensure accuracy
# # - tolerance=1e-5: Acceptable accuracy drop

你说气人不气人，官方文档都给你写好了。

怎么用？代码示例

示例 1：机器人导航

# 1. 克隆仓库
$ git clone https://atomgit.com/cann/skills.git
$ cd skills/skills/navigate

# 2. 准备环境
$ ln -s /data/ros_benchmark ./data

# 3. 修改配置（可选）
$ vi configs/optimal.yaml
# 修改 batch_size: 32 → batch_size: 64

# 4. 运行示例
$ python run_navigate.py

# 输出：
# ========================================
# Navigate Policy (Ascend NPU)
# ========================================
# Success rate: 92.5% (test set)
# Inference time: 12ms (target: <15ms)
# 
# Config: configs/optimal.yaml
# ========================================

示例 2：机器人抓取

# 1. 进入抓取目录
$ cd ../grasp

# 2. 修改配置
$ vi configs/optimal.yaml
# 修改：
#   skill: "grasp"
#   type: "reinforcement_learning"
#   pretrained: true

# 3. 运行示例
$ python run_grasp.py

# 输出：
# ========================================
# Grasp Policy (Ascend NPU)
# ========================================
# Success rate: 95.2% (test set)
# Inference time: 8ms (target: <10ms)
# 
# Config: configs/optimal.yaml
# ========================================

示例 3：机器人操纵

# 1. 进入操纵目录
$ cd ../manipulate

# 2. 修改配置
$ vi configs/optimal.yaml
# 修改：
#   skill: "manipulate"
#   type: "imitation_learning"
#   pretrained: true

# 3. 运行示例
$ python run_manipulate.py

# 输出：
# ========================================
# Manipulate Policy (Ascend NPU)
# ========================================
# Success rate: 88.7% (test set)
# Inference time: 15ms (target: <20ms)
# 
# Config: configs/optimal.yaml
# ========================================

示例 4：人机交互

# 1. 进入交互目录
$ cd ../interact

# 2. 修改配置
$ vi configs/optimal.yaml
# 修改：
#   skill: "interact"
#   type: "transfer_learning"
#   pretrained: true

# 3. 运行示例
$ python run_interact.py

# 输出：
# ========================================
# Interact Policy (Ascend NPU)
# ========================================
# Success rate: 90.1% (test set)
# Inference time: 10ms (target: <12ms)
# 
# Config: configs/optimal.yaml
# ========================================

性能数据

用 skills 的效率提升：

技能	自己写	用技能库	提升
导航	1 月	10 分钟	432x
抓取	1 月	10 分钟	432x
操纵	1.5 月	15 分钟	432x
交互	1 月	10 分钟	432x

提升：~432x

你说气人不气人，之前写 1 月，现在 10 分钟。

跟其他仓库的关系

skills 在 CANN 架构里属于第 2 层（昇腾计算服务层），是具身智能技能库。

依赖关系：

skills（具身智能技能库）
    ↑ 使用
cann-recipes-embodied-intelligence（具身智能最佳实践）
    ↑ 调用
昇腾 NPU

解释一下：

skills：具身智能技能库（导航/抓取/操纵/交互）
cann-recipes-embodied-intelligence：具身智能最佳实践
昇腾 NPU：硬件

简单说：skills 是具身智能的"技能库"。想做机器人学习，就用它。

skills 的核心内容

1. 技能

# 支持的技能
skills/navigate/       # 导航
skills/grasp/          # 抓取
skills/manipulate/     # 操纵
skills/interact/       # 交互

2. 配置

# configs/optimal.yaml
skill:
  name: "..."
  type: "..."

performance:
  use_npu: true
  num_threads: ...
  batch_size: ...

accuracy:
  pretrained: ...
  precision: "..."
  tolerance: ...

3. 脚本

# scripts/run_grasp.py
def main():
    # 加载配置
    # 加载技能
    # 运行
    # 输出结果
    # 验证精度

4. 最佳实践

# best_practices/
rl/              # 强化学习
il/              # 模仿学习
tl/              # 迁移学习

适用场景

什么情况下用 skills：

具身智能：机器人学习
强化学习：要 RL 技能
模仿学习：要 IL 技能
迁移学习：要 TL 技能

什么情况下不用：

纯视觉：用 cann-recipes-spatial-intelligence
自定义技能：自己写

总结

skills 就是昇腾的"具身智能技能库"：

导航：机器人导航
抓取：机器人抓取
操纵：机器人操纵
交互：人机交互

人工智能6S服务平台

作为“人工智能6S店”的官方数字引擎，为AI开发者与企业提供一个覆盖软硬件全栈、一站式门户。

更多推荐

cover

torchtitan-npu——在昇腾NPU上跑通PyTorch原生训练

人工智能6S服务平台

cover

HarmonyOS 鸿蒙PC三方库移植：vcpkg方式的 Port 脚本编写简明教程

人工智能6S服务平台

cover

ops-transformer 快速上手：FlashAttention

人工智能6S服务平台

所有评论(0)

查看更多评论

ff07

已为社区贡献4条内容