RocketMQ 消息存储机制分析

3年前 (2022) 程序员胖胖胖虎阿

330 0 0

🍊 Java学习：Java从入门到精通总结

🍊 深入浅出RocketMQ设计思想：深入浅出RocketMQ设计思想

🍊 绝对不一样的职场干货：大厂最佳实践经验指南

📆 最近更新：2022年8月26日

🍊 个人简介：通信工程本硕💪、Java程序员🌕。做过科研paper，发过专利，优秀的程序员不应该只是CRUD

🍊 点赞 👍 收藏 ⭐留言 📝 都是我最大的动力！

作为一个消息中间件，消息存储的效率直接影响到消息存取的效率，RocketMQ的单机吞吐量达到10w级别也和其存储设计有关，文本就对其进行一些探索。

文章目录

RocketMQ整体存储架构
- 消息生产与消费消息互相隔离
- CommitLog文件采用混合型存储
- 顺序读写
mmap内存映射技术
- 技术特点
- `MappedByteBuffer` 分析
- 使用`mmap`的注意点
操作系统PageCache机制
- RocketMQ里的实现方式

RocketMQ整体存储架构

RocketMQ 消息存储机制分析

消息生产与消费消息互相隔离

Producer发送的消息最终写入的是CommitLog文件，Consumer首先从ConsumerQueue里读取持久化消息的offset，消息大小、消息tag属性的Hash值，之后再从CommitLog中读取消息的真正内容

CommitLog文件采用混合型存储

所有Topic下的消息队列公用同一个CommitLog，并通过建立类似索引的方式来区分不同Topic下的不同MessageQueue的消息。

此外，只有异步线程通过doDispatch方法异步生成了ConsumerQueue里的元素后，Consumer才能进行消息的消费。

由此看来，只要消息写入到CommitLog之后，即使ConsumerQueue里的消息丢失了，也可以通过CommitLog来恢复。

顺序读写

发送消息时，Producer发送的消息是按照顺序写入CommitLog的；消费消息时，Consumer也是顺序从ConsumerQueue里读取消息的。

从CommitLog文件中读取数据时是随机读取，根据消息在CommitLog文件中的起始offset来读取消息的内容

在RocketMQ集群的并发量很高的情况下，文件的随机IO开销还是很大的，RocketMQ会使用其他的手段来避免这个问题，将在后面分析~

总结一下，RocketMQ的存储架构设计的优缺点如下：

优点： ConsumerQueue消息逻辑队列比较轻量级；串行访问磁盘避免了磁盘竞争，避免了因为队列个数的增大而导致IO等待时间增大
缺点： CommitLog是随机读取；Consumer如果想要消费一条消息的话，需要先读ConsumerQueue，再读CommitLog，额外增加了一次开销

mmap内存映射技术

mmap和write/read一样需要从PageCache中刷盘，但mmap可以直接将PageCache刷到硬盘上而不需要经过内核态，减少了1次数据复制的过程。

RocketMQ 消息存储机制分析

技术特点

mmap的特点就是他不用像普通IO操作那样将文件中的数据先拷贝到操作系统的内核IO缓冲区，而是直接将客户端进程的私有地址空间中的一块区域和文件对象建立映射关系。

这样一来，程序就像直接从内存中完成对文件的读写操作一样。

当缺页中断发生时，直接将文件从磁盘拷贝到客户端的进程空间内只需要进行一次数据拷贝。对于大文件来说

`MappedByteBuffer` 分析

JDK中的源码如下：

public abstract class MappedByteBuffer extends ByteBuffer{
    private final FileDescriptor fd;

    // This should only be invoked by the DirectByteBuffer constructors
    //
    MappedByteBuffer(int mark, int pos, int lim, int cap, // package-private
                     FileDescriptor fd)
    {
        super(mark, pos, lim, cap);
        this.fd = fd;
    }

    MappedByteBuffer(int mark, int pos, int lim, int cap) { // package-private
        super(mark, pos, lim, cap);
        this.fd = null;
    }

    private void checkMapped() {
        if (fd == null)
            // Can only happen if a luser explicitly casts a direct byte buffer
            throw new UnsupportedOperationException();
    }

    // Returns the distance (in bytes) of the buffer from the page aligned address
    // of the mapping. Computed each time to avoid storing in every direct buffer.
    private long mappingOffset() {
        int ps = Bits.pageSize();
        long offset = address % ps;
        return (offset >= 0) ? offset : (ps + offset);
    }

    private long mappingAddress(long mappingOffset) {
        return address - mappingOffset;
    }

    private long mappingLength(long mappingOffset) {
        return (long)capacity() + mappingOffset;
    }

    public final boolean isLoaded() {
        checkMapped();
        if ((address == 0) || (capacity() == 0))
            return true;
        long offset = mappingOffset();
        long length = mappingLength(offset);
        return isLoaded0(mappingAddress(offset), length, Bits.pageCount(length));
    }

    // not used, but a potential target for a store, see load() for details.
    private static byte unused;

    public final MappedByteBuffer load() {
        checkMapped();
        if ((address == 0) || (capacity() == 0))
            return this;
        long offset = mappingOffset();
        long length = mappingLength(offset);
        load0(mappingAddress(offset), length);

        // Read a byte from each page to bring it into memory. A checksum
        // is computed as we go along to prevent the compiler from otherwise
        // considering the loop as dead code.
        Unsafe unsafe = Unsafe.getUnsafe();
        int ps = Bits.pageSize();
        int count = Bits.pageCount(length);
        long a = mappingAddress(offset);
        byte x = 0;
        for (int i=0; i<count; i++) {
            x ^= unsafe.getByte(a);
            a += ps;
        }
        if (unused != 0)
            unused = x;

        return this;
    }

    public final MappedByteBuffer force() {
        checkMapped();
        if ((address != 0) && (capacity() != 0)) {
            long offset = mappingOffset();
            force0(fd, mappingAddress(offset), mappingLength(offset));
        }
        return this;
    }

    private native boolean isLoaded0(long address, long length, int pageCount);
    private native void load0(long address, long length);
    private native void force0(FileDescriptor fd, long address, long length);
}

可以看出， MappedByteBuffer继承了ByteBuffer，ByteBuffer内部维护了一个变量address表示逻辑地址。在建立映射关系时，使用FileChannel类下的map()方法把文件对象映射到虚拟内存
RocketMQ 消息存储机制分析
map()方法底层调用了本地方法来完成文件的映射操作；get()方法是通过底层以地址 + 偏移量的方式来获取指定映射到内存中的数据。