Skill Discovery | 无监督技能发现的经典工作总结

目录

  • [🐱 Unsupervised](#🐱 Unsupervised)
    • [Diversity is All You Need: Learning Skills without a Reward Function (diayn)](#Diversity is All You Need: Learning Skills without a Reward Function (diayn))
    • [Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills (EDL)](#Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills (EDL))
    • [CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery](#CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery)
    • [Lipschitz-constrained Unsupervised Skill Discovery (LSD)](#Lipschitz-constrained Unsupervised Skill Discovery (LSD))
    • [Controllability-Aware Unsupervised Skill Discovery (CSD)](#Controllability-Aware Unsupervised Skill Discovery (CSD))
    • [METRA: Scalable Unsupervised RL with Metric-Aware Abstraction](#METRA: Scalable Unsupervised RL with Metric-Aware Abstraction)
    • [Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning (csf)](#Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning (csf))
    • [Foundation policies with hilbert representations (HILP, offline metra)](#Foundation policies with hilbert representations (HILP, offline metra))
    • [Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning (DUDSi)](#Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning (DUDSi))
    • [SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions](#SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions)
    • [Efficient Skill Discovery via Regret-Aware Optimization](#Efficient Skill Discovery via Regret-Aware Optimization)
  • [🦜 Guided](#🦜 Guided)
    • [Safety-Aware Unsupervised Skill Discovery](#Safety-Aware Unsupervised Skill Discovery)
    • [Do's and Don'ts: Learning Desirable Skills with Instruction Videos (dodont)](#Do's and Don'ts: Learning Desirable Skills with Instruction Videos (dodont))
    • [Language Guided Skill Discovery (LGSD)](#Language Guided Skill Discovery (LGSD))
    • [Reference Guided Skill Discovery (RGSD)](#Reference Guided Skill Discovery (RGSD))
    • [Controlled Diversity with Preference: Towards Learning a Diverse Set of Desired Skills (CDP)](#Controlled Diversity with Preference: Towards Learning a Diverse Set of Desired Skills (CDP))
    • [Human-Aligned Skill Discovery Balancing Behaviour Exploration and Alignment (HaSD)](#Human-Aligned Skill Discovery Balancing Behaviour Exploration and Alignment (HaSD))
    • [Guiding Skill Discovery with Foundation Models (fog)](#Guiding Skill Discovery with Foundation Models (fog))

🐱 Unsupervised

Diversity is All You Need: Learning Skills without a Reward Function (diayn)

Explore, Discover and Learn: Unsupervised Discovery of State-Covering Skills (EDL)

CIC: Contrastive Intrinsic Control for Unsupervised Skill Discovery

Lipschitz-constrained Unsupervised Skill Discovery (LSD)

Controllability-Aware Unsupervised Skill Discovery (CSD)

METRA: Scalable Unsupervised RL with Metric-Aware Abstraction

Can a MISL Fly? Analysis and Ingredients for Mutual Information Skill Learning (csf)

Foundation policies with hilbert representations (HILP, offline metra)

Disentangled Unsupervised Skill Discovery for Efficient Hierarchical Reinforcement Learning (DUDSi)

SkiLD: Unsupervised Skill Discovery Guided by Factor Interactions

Efficient Skill Discovery via Regret-Aware Optimization

🦜 Guided

Safety-Aware Unsupervised Skill Discovery

Do's and Don'ts: Learning Desirable Skills with Instruction Videos (dodont)

Language Guided Skill Discovery (LGSD)

Reference Guided Skill Discovery (RGSD)

Controlled Diversity with Preference: Towards Learning a Diverse Set of Desired Skills (CDP)

Human-Aligned Skill Discovery Balancing Behaviour Exploration and Alignment (HaSD)

Guiding Skill Discovery with Foundation Models (fog)

相关推荐
MoonOut5 个月前
Skill Discovery | RGSD:基于高质量参考轨迹,预训练 skill space
skill discovery
MoonOut9 个月前
Skill Discovery | METRA:让策略探索 state 的紧凑 embedding space
skill discovery
MoonOut9 个月前
Skill Discovery | LGSD:用描述 state 的语言 embedding 的距离,作为 metra 的 d(x,y) 距离约束
skill discovery
MoonOut9 个月前
Skill Discovery | DoDont:使用 do + don't 示例视频,引导 agent 学习人类期望的 skill
skill discovery