封装红黑树实现 mymap 和 myset

文章目录

[封装红黑树实现 mymap 和 myset](#封装红黑树实现 mymap 和 myset)
- 一、源码及框架分析
- - [1.1 框架核心代码](#1.1 框架核心代码)
  - [1.2 红黑树的泛型设计](#1.2 红黑树的泛型设计)
  - [1.3 为何需要两个模板参数 `Key` 和 `Value`？](#1.3 为何需要两个模板参数 Key 和 Value？)
- [二、模拟实现 map 和 set](#二、模拟实现 map 和 set)
- - [2.1 实现红黑树框架，支持插入](#2.1 实现红黑树框架，支持插入)
  - - RBTree.h
    - Mymap.h
    - Myset.h
  - [2.2 支持迭代器](#2.2 支持迭代器)
  - - 迭代器实现思路
    - 迭代器代码
    - [在 RBTree 中集成迭代器](#在 RBTree 中集成迭代器)
  - [2.3 支持 `operator\[\]`](#2.3 支持 operator[])
  - [2.4 完整代码](#2.4 完整代码)
  - - [最终版 `Myset.h`](#最终版 Myset.h)
    - [最终版 `Mymap.h`](#最终版 Mymap.h)
    - [最终版 `RBTree.h`](#最终版 RBTree.h)
- 三、总结

封装红黑树实现 mymap 和 myset

一、源码及框架分析

在 SGI-STL 30 版本中，map 和 set 的实现巧妙地复用了同一棵红黑树（rb_tree）。其核心代码主要位于 stl_tree.h、stl_map.h 和 stl_set.h 中。

1.1 框架核心代码

以下是 set 和 map 的简化定义：

cpp 复制代码

// stl_set.h
template <class Key, class Compare = less<Key>, class Alloc = alloc>
class set {
public:
    typedef Key key_type;
    typedef Key value_type;
private:
    typedef rb_tree<key_type, value_type,
                    identity<value_type>, key_compare, Alloc> rep_type;
    rep_type t; // 底层红黑树
};

// stl_map.h
template <class Key, class T, class Compare = less<Key>, class Alloc = alloc>
class map {
public:
    typedef Key key_type;
    typedef T mapped_type;
    typedef pair<const Key, T> value_type;
private:
    typedef rb_tree<key_type, value_type,
                    select1st<value_type>, key_compare, Alloc> rep_type;
    rep_type t; // 底层红黑树
};

1.2 红黑树的泛型设计

rb_tree 通过模板参数实现泛型，使其既能用于 set（存储 Key），也能用于 map（存储 pair<const Key, T>）。其结点定义如下：

cpp 复制代码

template <class Value>
struct __rb_tree_node : public __rb_tree_node_base {
    typedef __rb_tree_node<Value>* link_type;
    Value value_field; // 存储的实际数据
};

rb_tree 的模板参数：

cpp 复制代码

template <class Key, class Value, class KeyOfValue, class Compare, class Alloc = alloc>
class rb_tree {
    // ...
};

Key ：键的类型，用于 find、erase 等接口的参数类型。
Value ：结点中存储的数据类型，在 set 中为 Key，在 map 中为 pair<const Key, T>。
KeyOfValue ：仿函数，用于从 Value 中提取 Key，因为红黑树在比较时只比较键。

1.3 为何需要两个模板参数 `Key` 和 `Value`？

set 的两个模板参数相同，map 则不同。这是因为：

Value 决定了结点存储的内容。
Key 决定了 find、erase 等函数接受的参数类型。

map 中 Value 是 pair<const Key, T>，但查找时只需传入 Key 类型的值，因此两个模板参数缺一不可。

注：源码中命名风格略有混乱，set 使用 Key，map 使用 Key 和 T，rb_tree 又使用 Key 和 Value，但设计思路清晰。

二、模拟实现 map 和 set

接下来，我们将基于自己实现的红黑树，封装出 map 和 set。

2.1 实现红黑树框架，支持插入

我们首先实现一个红黑树 RBTree，它通过 KeyOfT 仿函数提取键值进行比较。

RBTree.h

cpp 复制代码

enum Colour { RED, BLACK };

template<class T>
struct RBTreeNode {
    T _data;
    RBTreeNode<T>* _left;
    RBTreeNode<T>* _right;
    RBTreeNode<T>* _parent;
    Colour _col;

    RBTreeNode(const T& data)
        : _data(data), _left(nullptr), _right(nullptr), _parent(nullptr), _col(RED) {}
};

template<class K, class T, class KeyOfT>
class RBTree {
    typedef RBTreeNode<T> Node;
public:
    bool Insert(const T& data) {
        if (_root == nullptr) {
            _root = new Node(data);
            _root->_col = BLACK;
            return true;
        }

        KeyOfT kot;
        Node* parent = nullptr;
        Node* cur = _root;
        while (cur) {
            if (kot(cur->_data) < kot(data)) {
                parent = cur;
                cur = cur->_right;
            } else if (kot(cur->_data) > kot(data)) {
                parent = cur;
                cur = cur->_left;
            } else {
                return false; // 键已存在
            }
        }

        cur = new Node(data);
        Node* newnode = cur;
        cur->_col = RED;

        if (kot(parent->_data) < kot(data))
            parent->_right = cur;
        else
            parent->_left = cur;
        cur->_parent = parent;

        // 后续平衡处理（旋转、变色）省略，完整代码见文末

        return true;
    }
private:
    Node* _root = nullptr;
};

Mymap.h

cpp 复制代码

namespace bit {
    template<class K, class V>
    class map {
        struct MapKeyOfT {
            const K& operator()(const pair<K, V>& kv) {
                return kv.first;
            }
        };
    public:
        bool insert(const pair<K, V>& kv) {
            return _t.Insert(kv);
        }
    private:
        RBTree<K, pair<K, V>, MapKeyOfT> _t;
    };
}

Myset.h

cpp 复制代码

namespace bit {
    template<class K>
    class set {
        struct SetKeyOfT {
            const K& operator()(const K& key) {
                return key;
            }
        };
    public:
        bool insert(const K& key) {
            return _t.Insert(key);
        }
    private:
        RBTree<K, K, SetKeyOfT> _t;
    };
}

2.2 支持迭代器

迭代器的核心是 operator++ 和 operator--，实现中序遍历的步进逻辑。

迭代器实现思路

begin()：返回中序第一个结点（最左结点）。
end() ：返回 nullptr（或源码中的哨兵头结点）。
operator++() ：
- 若右子树非空，找右子树的最左结点。
- 若右子树为空，向上找第一个"当前结点是左孩子"的祖先结点。
operator--() ：逻辑与 ++ 对称，反向遍历。

迭代器代码

cpp 复制代码

template<class T, class Ref, class Ptr>
struct RBTreeIterator {
    typedef RBTreeNode<T> Node;
    typedef RBTreeIterator<T, Ref, Ptr> Self;

    Node* _node;
    Node* _root; // 用于处理 end() 的情况

    RBTreeIterator(Node* node, Node* root) : _node(node), _root(root) {}

    Ref operator*() { return _node->_data; }
    Ptr operator->() { return &_node->_data; }

    Self& operator++() {
        if (_node->_right) {
            Node* leftMost = _node->_right;
            while (leftMost->_left) leftMost = leftMost->_left;
            _node = leftMost;
        } else {
            Node* cur = _node;
            Node* parent = cur->_parent;
            while (parent && cur == parent->_right) {
                cur = parent;
                parent = cur->_parent;
            }
            _node = parent;
        }
        return *this;
    }

    Self& operator--() {
        if (_node == nullptr) { // 处理 --end()
            Node* rightMost = _root;
            while (rightMost && rightMost->_right) rightMost = rightMost->_right;
            _node = rightMost;
        } else if (_node->_left) {
            Node* rightMost = _node->_left;
            while (rightMost->_right) rightMost = rightMost->_right;
            _node = rightMost;
        } else {
            Node* cur = _node;
            Node* parent = cur->_parent;
            while (parent && cur == parent->_left) {
                cur = parent;
                parent = cur->_parent;
            }
            _node = parent;
        }
        return *this;
    }

    bool operator!=(const Self& s) const { return _node != s._node; }
    bool operator==(const Self& s) const { return _node == s._node; }
};

在 RBTree 中集成迭代器

cpp 复制代码

template<class K, class T, class KeyOfT>
class RBTree {
public:
    typedef RBTreeIterator<T, T&, T*> Iterator;
    typedef RBTreeIterator<T, const T&, const T*> ConstIterator;

    Iterator Begin() {
        Node* leftMost = _root;
        while (leftMost && leftMost->_left) leftMost = leftMost->_left;
        return Iterator(leftMost, _root);
    }

    Iterator End() { return Iterator(nullptr, _root); }

    // 同理实现 ConstIterator
private:
    Node* _root = nullptr;
};

2.3 支持 `operator[]`

map 的 [] 需要借助 insert 的返回值实现。因此，RBTree::Insert 需返回 pair<Iterator, bool>。

cpp 复制代码

pair<Iterator, bool> Insert(const T& data) {
    // 插入逻辑，返回插入位置的迭代器及是否成功
}

然后在 map 中：

cpp 复制代码

V& operator[](const K& key) {
    pair<iterator, bool> ret = insert(make_pair(key, V()));
    return ret.first->second;
}

2.4 完整代码

最终版 `Myset.h`

cpp 复制代码

#include "RBTree.h"

namespace bit {
    template<class K>
    class set {
        struct SetKeyOfT {
            const K& operator()(const K& key) { return key; }
        };
    public:
        typedef typename RBTree<K, const K, SetKeyOfT>::Iterator iterator;
        typedef typename RBTree<K, const K, SetKeyOfT>::ConstIterator const_iterator;

        iterator begin() { return _t.Begin(); }
        iterator end() { return _t.End(); }
        const_iterator begin() const { return _t.Begin(); }
        const_iterator end() const { return _t.End(); }

        pair<iterator, bool> insert(const K& key) { return _t.Insert(key); }
        iterator find(const K& key) { return _t.Find(key); }

    private:
        RBTree<K, const K, SetKeyOfT> _t;
    };
}

最终版 `Mymap.h`

cpp 复制代码

#include "RBTree.h"

namespace bit {
    template<class K, class V>
    class map {
        struct MapKeyOfT {
            const K& operator()(const pair<K, V>& kv) { return kv.first; }
        };
    public:
        typedef typename RBTree<K, pair<const K, V>, MapKeyOfT>::Iterator iterator;
        typedef typename RBTree<K, pair<const K, V>, MapKeyOfT>::ConstIterator const_iterator;

        iterator begin() { return _t.Begin(); }
        iterator end() { return _t.End(); }
        const_iterator begin() const { return _t.Begin(); }
        const_iterator end() const { return _t.End(); }

        pair<iterator, bool> insert(const pair<K, V>& kv) { return _t.Insert(kv); }
        iterator find(const K& key) { return _t.Find(key); }

        V& operator[](const K& key) {
            pair<iterator, bool> ret = insert(make_pair(key, V()));
            return ret.first->second;
        }

    private:
        RBTree<K, pair<const K, V>, MapKeyOfT> _t;
    };
}

最终版 `RBTree.h`

完整代码请参考文末总结部分，或结合上述片段整合。核心包括：

结点定义
插入与平衡（旋转、变色）
迭代器实现
Find、Begin、End 等接口

三、总结

通过封装红黑树实现 map 和 set，我们深入理解了 STL 中容器复用的设计思想：

泛型设计 ：红黑树通过 Value 模板参数决定存储内容，通过 KeyOfT 仿函数提取键值进行比较。
迭代器实现：中序遍历的步进逻辑是迭代器实现的核心，需同时处理左右子树与祖先关系。
map 的 [] 实现 ：依赖于 insert 的返回值，简洁高效。
权限控制 ：set 的迭代器不允许修改键值，通过将 Value 模板参数设为 const K 实现；map 则通过 pair<const K, V> 保护键不被修改。

这种复用方式不仅减少了代码量，还体现了面向对象与泛型编程的强大结合。

封装红黑树实现 mymap 和 myset

文章目录