1 2 3 4 5 6 7 作者:李晓辉 联系方式: 1. 微信:Lxh_Chat 2. 邮箱:939958092@qq.com
实验目标:将serverc上rbdtest池中的lxhrbdname镜像和serverf的rbdtest池中的同名镜像保持同步
创建RBD源池 创建一个名为rbdtest的池,稍后将会和第二个集群进行池同步
1 2 3 4 5 [root@serverc ~]# ceph osd pool create rbdtest pool 'rbdtest' created [root@serverc ~]# ceph osd pool application enable rbdtest rbd enabled application 'rbd' on pool 'rbdtest' [root@serverc ~]# rbd pool init -p rbdtest
创建备份集群目标RBD池 请参考ceph部署完成第二套集群部署
创建第二套集群中同名池为rbdtest的rbd池
1 2 3 4 5 [root@serverf ~]# ceph osd pool create rbdtest pool 'rbdtest' created [root@serverf ~]# ceph osd pool application enable rbdtest rbd enabled application 'rbd' on pool 'rbdtest' [root@server ~]# rbd pool init -p rbdtest
创建源镜像 在源集群的rbdtest池中创建一个名为lxhrbdname的镜像,大小为1000M,启用 exclusive-lock
和 journaling
RBD 镜像功能
注意到mirroring state: disabled
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 [root@serverc ~]# rbd create --size 1000M -p rbdtest lxhrbdname \ --image-feature=exclusive-lock,journaling [root@serverc ~]# rbd info -p rbdtest lxhrbdname rbd image 'lxhrbdname' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : dbc45a6069eb block_name_prefix: rbd_data.dbc45a6069eb format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Wed Aug 7 05:22:20 2024 access_timestamp: Wed Aug 7 05:22:20 2024 modify_timestamp: Wed Aug 7 05:22:20 2024 journal: dbc45a6069eb mirroring state: disabled
启用池模式mirror 在源集群中的rbdtest池上启用mirror同步,注意info中的同步信息
注意到mirroring state: enabled
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 [root@serverc ~]# rbd mirror pool enable rbdtest pool [root@serverc ~]# rbd info -p rbdtest lxhrbdname rbd image 'lxhrbdname' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : dbc45a6069eb block_name_prefix: rbd_data.dbc45a6069eb format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Wed Aug 7 05:22:20 2024 access_timestamp: Wed Aug 7 05:22:20 2024 modify_timestamp: Wed Aug 7 05:22:20 2024 journal: dbc45a6069eb mirroring state: enabled mirroring mode: journal mirroring global id : d91f0fe4-4f27-4df2-88b1-38e3ffe9b177 mirroring primary: true
准备源同步token 在源集群中生成同步的key,将源集群的站点命名为site1,保存好,稍后用于复制到第二套集群
1 2 3 4 [root@serverc ~]# rbd mirror pool peer bootstrap create \ --site-name site1 rbdtest > site1.key [root@serverc ~]# cat site1.key eyJmc2lkIjoiMmFlNmQwNWEtMjI5YS0xMWVjLTky...
1 2 [root@serverc ~]# scp -p site1.key root@serverf:/root site1.key 100% 253 142.4KB/s 00:00
目标集群部署rbd-mirror进程
1 2 3 4 [root@serverf ~]# ceph orch apply rbd-mirror --placement=serverf.lab.example.com Scheduled rbd-mirror update... [root@serverf ~]# ceph orch ps | grep mirror rbd-mirror.serverf.veenfh serverf.lab.example.com running (5s) 1s ago 5s - 16.2.0-117.el8cp 2142b60d7974 c9b6b8ff89e8
导入源集群token密钥 导入源集群的key,并命名本地集群为backupceph,从info信息中看到mirroring primary: false
忽略包含以下文本的已知错误:auth: unable to find a keyring on …
import这一步执行成功后,在备份集群就已经可以看到源镜像了
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 [root@serverf ~]# rbd mirror pool peer bootstrap import \ --site-name backupceph --direction rx-only rbdtest /root/site1.key [root@serverf ~]# [root@cephbackup ~]# rbd -p rbdtest ls lxhrbdname [root@serverf ~]# rbd -p rbdtest info lxhrbdname rbd image 'lxhrbdname' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : 37a88b1c5510 block_name_prefix: rbd_data.37a88b1c5510 format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Sun Sep 18 15:18:32 2022 access_timestamp: Sun Sep 18 15:18:32 2022 modify_timestamp: Sun Sep 18 15:18:32 2022 journal: 37a88b1c5510 mirroring state: enabled mirroring mode: journal mirroring global id : 165a0e83-8e8b-43cb-8d93-6b1812e3c58c mirroring primary: false
查询同步状态 在两台机器上,分别执行这个命令,可以看到两个站点的信息
1 2 3 4 5 6 7 8 9 10 [root@serverf ~]# rbd mirror pool info rbdtest Mode: pool Site Name: backupceph Peer Sites: UUID: 6b9fb207-4555-4637-8954-f637453081c9 Name: site1 Direction: rx-only Client: client.rbd-mirror-peer
在备份的集群中,查询守护进程和镜像的健康状态
1 2 3 4 5 6 [root@serverf ~]# rbd mirror pool status -p rbdtest health: OK daemon health: OK image health: OK images: 1 total 1 replaying
再次到源集群去看看镜像状态
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 [root@serverc ~]# rbd -p rbdtest info lxhrbdname rbd image 'lxhrbdname' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : d3d46acc6e3f block_name_prefix: rbd_data.d3d46acc6e3f format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Fri Sep 8 12:12:16 2023 access_timestamp: Fri Sep 8 12:12:16 2023 modify_timestamp: Fri Sep 8 12:12:16 2023 journal: d3d46acc6e3f mirroring state: enabled mirroring mode: journal mirroring global id : 57379079-48cc-4091-b08c-01e087562d9b mirroring primary: true
再次到源集群去看看池的同步状态
1 2 3 4 5 6 7 8 9 10 [root@serverc ~]# rbd mirror pool info rbdtest Mode: pool Site Name: site1 Peer Sites: UUID: e060d31c-94c7-495d-8612-4ae890e0a4e7 Name: backupceph Mirror UUID: eac9e8a9-b13c-46f3-9997-6c22555b77b7 Direction: tx-only
创建新镜像测试是否同步 重新在源集群的池中创建一个镜像,稍后再去第二套集群看看是否同步成功
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 [root@serverc ~]# rbd create --size 1000M -p rbdtest image2 \ > --image-feature=exclusive-lock,journaling [root@serverc ~]# rbd info image2 -p rbdtest rbd image 'image2' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : d420d153d504 block_name_prefix: rbd_data.d420d153d504 format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Fri Sep 8 12:23:23 2023 access_timestamp: Fri Sep 8 12:23:23 2023 modify_timestamp: Fri Sep 8 12:23:23 2023 journal: d420d153d504 mirroring state: enabled mirroring mode: journal mirroring global id : 644301a8-22b5-46e8-9027-103ecfbc7f74 mirroring primary: true
在第二套集群中查看,发现已经同步成功
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 [root@serverf ~]# rbd -p rbdtest ls image2 lxhrbdname [root@serverf ~]# rbd -p rbdtest info image2 rbd image 'image2' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : 8575f69a1f block_name_prefix: rbd_data.8575f69a1f format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Fri Sep 8 12:23:25 2023 access_timestamp: Fri Sep 8 12:23:25 2023 modify_timestamp: Fri Sep 8 12:23:25 2023 journal: 8575f69a1f mirroring state: enabled mirroring mode: journal mirroring global id : 644301a8-22b5-46e8-9027-103ecfbc7f74 mirroring primary: false
测试删除镜像的同步 这个删除,只能从源集群删除
1 2 3 4 5 6 7 [root@serverc ~]# rbd ls -p rbdtest image2 lxhrbdname [root@serverc ~]# rbd rm lxhrbdname -p rbdtest Removing image: 100% complete...done. [root@serverc ~]# rbd ls -p rbdtest image2
到备份集群看看删除成功没
1 2 [root@serverf ~]# rbd ls -p rbdtest image2
一些额外的mirror命令:
如果两个peer集群之间的状态不一致,rbd-mirror守护进程不会尝试mirror不一致的映像,在备份集群上 使用rbd mirror image resync重新同步映像
1 2 [root@serverf ~]# rbd mirror image resync rbdtest/image2 Flagged image for resync from primary
使用rbd mirror image enable或rbd mirror image disable在两个对端存储集群上的整个池image模式中启用或禁用mirroring模式
1 2 rbd mirror image enable mypool/myimage rbd mirror image disable mypool/myimage
使用基于快照的镜像,通过禁用镜像和启用快照,将基于日志的镜像转换为基于快照的镜像
1 2 rbd mirror image disable mypool/myimage rbd mirror image enable mypool/myimage snapshot
故障转移过程 如果主要 RBD 镜像变得不可用,您可按照步骤启动对次要 RBD 镜像的访问:
停止访问主要 RBD 镜像。也就是说,停止正在使用该镜像的所有应用和虚拟机。
使用 rbd mirror image demote rbdtest/image2
命令来降级主要 RBD 镜像。
使用 rbd mirror image promote rbdtest/image2
命令来升级次要 RBD 镜像。
恢复访问 RBD 镜像。重新启动应用和虚拟机。
在非有序关机后发生故障转移时,您必须从备份存储集群中的 Ceph 监控器节点升级非主要镜像。使用 --force
选项,因为降级无法传播到主要存储集群
降级集群镜像 降级操作必须是源集群执行,除非源集群不存在,可以尝试用–force
我们发现降级后,serverc就不再是primary了
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 41 42 [root@serverc ~]# rbd info rbdtest/image2 rbd image 'image2' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : d420d153d504 block_name_prefix: rbd_data.d420d153d504 format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Fri Sep 8 12:23:23 2023 access_timestamp: Fri Sep 8 12:23:23 2023 modify_timestamp: Fri Sep 8 12:23:23 2023 journal: d420d153d504 mirroring state: enabled mirroring mode: journal mirroring global id : 644301a8-22b5-46e8-9027-103ecfbc7f74 mirroring primary: true [root@serverc ~]# rbd mirror image demote rbdtest/image2 Image demoted to non-primary [root@serverc ~]# rbd info rbdtest/image2 rbd image 'image2' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : d420d153d504 block_name_prefix: rbd_data.d420d153d504 format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Fri Sep 8 12:23:23 2023 access_timestamp: Fri Sep 8 12:23:23 2023 modify_timestamp: Fri Sep 8 12:23:23 2023 journal: d420d153d504 mirroring state: enabled mirroring mode: journal mirroring global id : 644301a8-22b5-46e8-9027-103ecfbc7f74 mirroring primary: false
提升集群镜像 在备份集群提升了镜像后,serverf变成了primary
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 29 30 31 32 33 34 35 36 37 38 39 40 [root@serverf ~]# rbd info rbdtest/image2 rbd image 'image2' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : 85757ad71c90 block_name_prefix: rbd_data.85757ad71c90 format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Fri Sep 8 12:32:14 2023 access_timestamp: Fri Sep 8 12:32:14 2023 modify_timestamp: Fri Sep 8 12:32:14 2023 journal: 85757ad71c90 mirroring state: enabled mirroring mode: journal mirroring global id : 644301a8-22b5-46e8-9027-103ecfbc7f74 mirroring primary: false [root@serverf ~]# rbd mirror image promote rbdtest/image2 Image promoted to primary [root@serverf ~]# rbd info rbdtest/image2 rbd image 'image2' : size 1000 MiB in 250 objects order 22 (4 MiB objects) snapshot_count: 0 id : 85757ad71c90 block_name_prefix: rbd_data.85757ad71c90 format: 2 features: exclusive-lock, journaling op_features: flags: create_timestamp: Fri Sep 8 12:32:14 2023 access_timestamp: Fri Sep 8 12:32:14 2023 modify_timestamp: Fri Sep 8 12:32:14 2023 journal: 85757ad71c90 mirroring state: enabled mirroring mode: journal mirroring global id : 644301a8-22b5-46e8-9027-103ecfbc7f74 mirroring primary: true