draid: allow seq resilver reads from degraded vdevs

When sequentially resilvering allow a dRAID child to be read
as long as the DTLs indicate it should have a good copy of the
data and the leaf isn't being rebuilt.  The previous check was
slightly too broad and would skip dRAID spare and replacing
vdevs if one of their children was being replaced.  As long
as there exists enough additional redundancy this is fine, but
when there isn't this vdev must be read in order to correctly
reconstruct the missing data.

A new test case has been added which exhausts the available
redundancy, faults another device causing it to be degraded,
and then performs a sequential resilver for the degraded device.
In such a situation enough redundancy exists to perform the
replacement and a scrub should detect no checksum errors.

Reviewed-by: Alexander Motin <alexander.motin@TrueNAS.com>
Reviewed-by: Andriy Tkachuk <andriy.tkachuk@seagate.com>
Reviewed-by: Akash B <akash-b@hpe.com>
Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>
Closes #18405
This commit is contained in:
Brian Behlendorf
2026-04-07 10:48:27 -07:00
committed by Tony Hutter
parent 63b8da8ff7
commit e9a8c6e080
6 changed files with 162 additions and 35 deletions
-2
View File
@@ -252,8 +252,6 @@ maybe = {
'projectquota/setup': ['SKIP', exec_reason],
'raidz/raidz_002_pos': ['FAIL', known_reason],
'raidz/raidz_expand_001_pos': ['FAIL', 16421],
'redundancy/redundancy_draid_spare1': ['FAIL', 18307],
'redundancy/redundancy_draid_spare3': ['FAIL', 18319],
'removal/removal_condense_export': ['FAIL', known_reason],
'renameat2/setup': ['SKIP', renameat2_reason],
'reservation/reservation_008_pos': ['FAIL', 7741],