Fixing gang ABD child removal race condition

On linux the list debug code has been setting off a failure when
checking that the node->next->prev value is pointing back at the node.
At times this check evaluates to 0xdead. When removing a child from a
gang ABD we must acquire the child's abd_mtx to make sure that the
same ABD is not being added to another gang ABD while it is being
removed from a gang ABD. This fixes a race condition when checking
if an ABDs link is already active and part of another gang ABD before
adding it to a gang.

Added additional debug code for the gang ABD in abd_verify() to make
sure each child ABD has active links. Also check to make sure another
gang ABD is not added to a gang ABD.

Reviewed-by: Serapheim Dimitropoulos <serapheim@delphix.com>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Reviewed-by: Matt Ahrens <matt@delphix.com>
Signed-off-by: Brian Atkinson <batkinson@lanl.gov>
Closes #10511
This commit is contained in:
Brian Atkinson
2020-07-14 12:04:35 -06:00
committed by GitHub
parent c15d36c674
commit e4d3d77684
4 changed files with 20 additions and 4 deletions
+1
View File
@@ -232,6 +232,7 @@ list_link_init(list_node_t *ln)
int
list_link_active(list_node_t *ln)
{
EQUIV(ln->next == NULL, ln->prev == NULL);
return (ln->next != NULL);
}