linux

History

Filipe Manana 09c826447c btrfs: fix filesystem corruption after a device replace commit `4c8f353272` upstream. We use a device's allocation state tree to track ranges in a device used for allocated chunks, and we set ranges in this tree when allocating a new chunk. However after a device replace operation, we were not setting the allocated ranges in the new device's allocation state tree, so that tree is empty after a device replace. This means that a fitrim operation after a device replace will trim the device ranges that have allocated chunks and extents, as we trim every range for which there is not a range marked in the device's allocation state tree. It is also important during chunk allocation, since the device's allocation state is used to determine if a range is already allocated when allocating a new chunk. This is trivial to reproduce and the following script triggers the bug: $ cat reproducer.sh #!/bin/bash DEV1="/dev/sdg" DEV2="/dev/sdh" DEV3="/dev/sdi" wipefs -a $DEV1 $DEV2 $DEV3 &> /dev/null # Create a raid1 test fs on 2 devices. mkfs.btrfs -f -m raid1 -d raid1 $DEV1 $DEV2 > /dev/null mount $DEV1 /mnt/btrfs xfs_io -f -c "pwrite -S 0xab 0 10M" /mnt/btrfs/foo echo "Starting to replace $DEV1 with $DEV3" btrfs replace start -B $DEV1 $DEV3 /mnt/btrfs echo echo "Running fstrim" fstrim /mnt/btrfs echo echo "Unmounting filesystem" umount /mnt/btrfs echo "Mounting filesystem in degraded mode using $DEV3 only" wipefs -a $DEV1 $DEV2 &> /dev/null mount -o degraded $DEV3 /mnt/btrfs if [ $? -ne 0 ]; then dmesg \| tail echo echo "Failed to mount in degraded mode" exit 1 fi echo echo "File foo data (expected all bytes = 0xab):" od -A d -t x1 /mnt/btrfs/foo umount /mnt/btrfs When running the reproducer: $ ./replace-test.sh wrote 10485760/10485760 bytes at offset 0 10 MiB, 2560 ops; 0.0901 sec (110.877 MiB/sec and 28384.5216 ops/sec) Starting to replace /dev/sdg with /dev/sdi Running fstrim Unmounting filesystem Mounting filesystem in degraded mode using /dev/sdi only mount: /mnt/btrfs: wrong fs type, bad option, bad superblock on /dev/sdi, missing codepage or helper program, or other error. [19581.748641] BTRFS info (device sdg): dev_replace from /dev/sdg (devid 1) to /dev/sdi started [19581.803842] BTRFS info (device sdg): dev_replace from /dev/sdg (devid 1) to /dev/sdi finished [19582.208293] BTRFS info (device sdi): allowing degraded mounts [19582.208298] BTRFS info (device sdi): disk space caching is enabled [19582.208301] BTRFS info (device sdi): has skinny extents [19582.212853] BTRFS warning (device sdi): devid 2 uuid 1f731f47-e1bb-4f00-bfbb-9e5a0cb4ba9f is missing [19582.213904] btree_readpage_end_io_hook: 25839 callbacks suppressed [19582.213907] BTRFS error (device sdi): bad tree block start, want 30490624 have 0 [19582.214780] BTRFS warning (device sdi): failed to read root (objectid=7): -5 [19582.231576] BTRFS error (device sdi): open_ctree failed Failed to mount in degraded mode So fix by setting all allocated ranges in the replace target device when the replace operation is finishing, when we are holding the chunk mutex and we can not race with new chunk allocations. A test case for fstests follows soon. Fixes: `1c11b63eff` ("btrfs: replace pending/pinned chunks lists with io tree") CC: stable@vger.kernel.org # 5.2+ Reviewed-by: Nikolay Borisov <nborisov@suse.com> Reviewed-by: Johannes Thumshirn <johannes.thumshirn@wdc.com> Signed-off-by: Filipe Manana <fdmanana@suse.com> Signed-off-by: David Sterba <dsterba@suse.com> Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>		2020-10-07 08:01:23 +02:00
..
tests	btrfs: Correctly handle empty trees in find_first_clear_extent_bit	2020-02-11 04:35:34 -08:00
acl.c	btrfs: cleanup btrfs_setxattr_trans and drop transaction parameter	2019-04-29 19:02:44 +02:00
async-thread.c	Btrfs: fix crash during unmount due to race with delayed inode workers	2020-04-17 10:50:15 +02:00
async-thread.h	Btrfs: fix crash during unmount due to race with delayed inode workers	2020-04-17 10:50:15 +02:00
backref.c	btrfs: fix double free on ulist after backref resolution failure	2020-07-29 10:18:30 +02:00
backref.h	btrfs: fiemap: preallocate ulists for btrfs_check_shared	2019-07-01 13:34:53 +02:00
block-group.c	btrfs: add wrapper for transaction abort predicate	2020-08-26 10:40:49 +02:00
block-group.h	btrfs: move struct io_ctl to free-space-cache.h	2019-09-09 14:59:15 +02:00
block-rsv.c	btrfs: force chunk allocation if our global rsv is larger than metadata	2020-06-22 09:31:13 +02:00
block-rsv.h	btrfs: migrate the global_block_rsv helpers to block-rsv.c	2019-07-02 12:30:55 +02:00
btrfs_inode.h	btrfs: remove assumption about csum type form btrfs_print_data_csum_error()	2019-07-01 13:35:02 +02:00
check-integrity.c	btrfs: fix possible NULL-pointer dereference in integrity checks	2020-02-24 08:36:53 +01:00
check-integrity.h
compression.c	btrfs: move cond_wake_up functions out of ctree	2019-09-09 14:59:15 +02:00
compression.h	btrfs: compression: replace set_level callbacks by a common helper	2019-09-09 14:59:11 +02:00
ctree.c	btrfs: set the lockdep class for log tree extent buffers	2020-09-09 19:12:31 +02:00
ctree.h	btrfs: detect nocow for swap after snapshot delete	2020-09-03 11:27:02 +02:00
delalloc-space.c	Btrfs: fix qgroup double free after failure to reserve metadata for delalloc	2019-10-17 20:13:44 +02:00
delalloc-space.h	btrfs: migrate the delalloc space stuff to it's own home	2019-07-04 17:26:17 +02:00
delayed-inode.c	btrfs: add wrapper for transaction abort predicate	2020-08-26 10:40:49 +02:00
delayed-inode.h
delayed-ref.c	Btrfs: fix race between adding and putting tree mod seq elements and nodes	2020-02-11 04:35:34 -08:00
delayed-ref.h	btrfs: migrate the delayed refs rsv code	2019-07-04 17:26:17 +02:00
dev-replace.c	btrfs: fix filesystem corruption after a device replace	2020-10-07 08:01:23 +02:00
dev-replace.h	btrfs: get fs_info from trans in btrfs_run_dev_replace	2019-04-29 19:02:43 +02:00
dir-item.c	btrfs: remove unused parameter fs_info from btrfs_extend_item	2019-04-29 19:02:50 +02:00
disk-io.c	btrfs: fix overflow when copying corrupt csums for a message	2020-10-01 13:18:24 +02:00
disk-io.h	btrfs: Make reada_tree_block_flagged private	2019-09-09 14:59:11 +02:00
export.c	btrfs: export helpers for subvolume name/id resolution	2020-08-26 10:40:49 +02:00
export.h	btrfs: export helpers for subvolume name/id resolution	2020-08-26 10:40:49 +02:00
extent_io.c	btrfs: fix potential deadlock in the search ioctl	2020-09-09 19:12:31 +02:00
extent_io.h	btrfs: fix potential deadlock in the search ioctl	2020-09-09 19:12:31 +02:00
extent_map.c	Btrfs: fix race between using extent maps and merging them	2020-02-19 19:53:00 +01:00
extent_map.h
extent-tree.c	btrfs: don't force read-only after error in drop snapshot	2020-10-01 13:18:04 +02:00
file-item.c	btrfs: do not ignore error from btrfs_next_leaf() when inserting checksums	2020-06-22 09:30:55 +02:00
file.c	btrfs: detect nocow for swap after snapshot delete	2020-09-03 11:27:02 +02:00
free-space-cache.c	btrfs: fix space cache memory leak after transaction abort	2020-09-03 11:27:02 +02:00
free-space-cache.h	btrfs: move struct io_ctl to free-space-cache.h	2019-09-09 14:59:15 +02:00
free-space-tree.c	btrfs: move basic block_group definitions to their own header	2019-09-09 14:59:03 +02:00
free-space-tree.h	btrfs: move basic block_group definitions to their own header	2019-09-09 14:59:03 +02:00
inode-item.c	btrfs: Make btrfs_find_name_in_ext_backref return struct btrfs_inode_extref	2019-09-09 14:59:16 +02:00
inode-map.c	btrfs: qgroup: Always free PREALLOC META reserve in btrfs_delalloc_release_extents()	2019-10-15 18:50:07 +02:00
inode-map.h
inode.c	btrfs: qgroup: fix data leak caused by race between writeback and truncate	2020-10-01 13:18:10 +02:00
ioctl.c	btrfs: fix wrong address when faulting in pages in the search ioctl	2020-09-17 13:47:52 +02:00
Kconfig	btrfs: Fix build error while LIBCRC32C is module	2019-07-17 17:03:30 +02:00
locking.c	btrfs: move cond_wake_up functions out of ctree	2019-09-09 14:59:15 +02:00
locking.h	btrfs: Remove unused locking functions	2019-09-09 14:58:59 +02:00
lzo.c	btrfs: compression: replace set_level callbacks by a common helper	2019-09-09 14:59:11 +02:00
Makefile	btrfs: migrate the block group lookup code	2019-09-09 14:59:04 +02:00
misc.h	btrfs: move math functions to misc.h	2019-09-09 14:59:15 +02:00
ordered-data.c	Btrfs: fix btrfs_wait_ordered_range() so that it waits for all ordered extents	2020-02-28 17:22:24 +01:00
ordered-data.h	btrfs: don't assume ordered sums to be 4 bytes	2019-07-01 13:35:00 +02:00
orphan.c
print-tree.c	btrfs: require only sector size alignment for parent eb bytenr	2020-09-17 13:47:51 +02:00
print-tree.h
props.c	btrfs: rename the btrfs_calc_*_metadata_size helpers	2019-09-09 14:59:13 +02:00
props.h	btrfs: delete unused function btrfs_set_prop_trans	2019-04-29 19:02:54 +02:00
qgroup.c	btrfs: make btrfs_qgroup_check_reserved_leak take btrfs_inode	2020-09-03 11:26:47 +02:00
qgroup.h	btrfs: make btrfs_qgroup_check_reserved_leak take btrfs_inode	2020-09-03 11:26:47 +02:00
raid56.c	btrfs: get rid of unique workqueue helper functions	2020-01-09 10:20:06 +01:00
raid56.h	btrfs: constify map parameter for nr_parity_stripes and nr_data_stripes	2019-07-01 13:34:58 +02:00
rcu-string.h
reada.c	btrfs: get rid of unique workqueue helper functions	2020-01-09 10:20:06 +01:00
ref-verify.c	btrfs: ref-verify: fix memory leak in add_block_entry	2020-08-21 13:05:21 +02:00
ref-verify.h	btrfs: ref-verify: Use btrfs_ref to refactor btrfs_ref_tree_mod()	2019-04-29 19:02:49 +02:00
relocation.c	btrfs: fix setting last_trans for reloc roots	2020-10-01 13:17:55 +02:00
root-tree.c	btrfs: do not delete mismatched root refs	2020-01-23 08:22:40 +01:00
scrub.c	btrfs: allocate scrub workqueues outside of locks	2020-09-09 19:12:31 +02:00
send.c	btrfs: send: emit file capabilities after chown	2020-06-22 09:31:12 +02:00
send.h
space-info.c	btrfs: fix lockdep splat from btrfs_dump_space_info	2020-08-19 08:16:01 +02:00
space-info.h	btrfs: improve global reserve stealing logic	2020-06-22 09:31:08 +02:00
struct-funcs.c	btrfs: tie extent buffer and it's token together	2019-09-09 14:59:16 +02:00
super.c	btrfs: reset compression level for lzo on remount	2020-09-03 11:27:02 +02:00
sysfs.c	btrfs: sysfs: use NOFS for device creation	2020-08-21 13:05:22 +02:00
sysfs.h	btrfs: sysfs: move helper macros to sysfs.c	2019-09-09 14:59:08 +02:00
transaction.c	btrfs: add wrapper for transaction abort predicate	2020-08-26 10:40:49 +02:00
transaction.h	btrfs: add wrapper for transaction abort predicate	2020-08-26 10:40:49 +02:00
tree-checker.c	btrfs: tree-checker: Check leaf chunk item size	2020-10-01 13:17:28 +02:00
tree-checker.h	btrfs: get fs_info from eb in btrfs_check_chunk_valid	2019-04-29 19:02:39 +02:00
tree-defrag.c
tree-log.c	btrfs: check the right error variable in btrfs_del_dir_entries_in_log	2020-09-03 11:27:02 +02:00
tree-log.h	btrfs: get fs_info from trans in btrfs_set_log_full_commit	2019-04-29 19:02:41 +02:00
ulist.c
ulist.h
uuid-tree.c	btrfs: handle ENOENT in btrfs_uuid_tree_iterate	2019-12-31 16:42:05 +01:00
volumes.c	btrfs: fix lockdep splat in add_missing_dev	2020-09-17 13:47:51 +02:00
volumes.h	btrfs: Remove btrfs_bio::flags member	2019-12-17 19:56:06 +01:00
xattr.c	Btrfs: fix failure to persist compression property xattr deletion on fsync	2019-06-17 16:37:17 +02:00
xattr.h	btrfs: cleanup btrfs_setxattr_trans and drop transaction parameter	2019-04-29 19:02:44 +02:00
zlib.c	btrfs: compression: replace set_level callbacks by a common helper	2019-09-09 14:59:11 +02:00
zstd.c	btrfs: move cond_wake_up functions out of ctree	2019-09-09 14:59:15 +02:00