mirror_zfs/include/os/linux/spl/sys
Matthew Ahrens 3442c2a02d
Revise ARC shrinker algorithm
The ARC shrinker callback `arc_shrinker_count/_scan()` is invoked by the
kernel's shrinker mechanism when the system is running low on free
pages.  This happens via 2 code paths:

1. "direct reclaim": The system is attempting to allocate a page, but we
are low on memory.  The ARC shrinker callback is invoked from the
page-allocation code path.

2. "indirect reclaim": kswapd notices that there aren't many free pages,
so it invokes the ARC shrinker callback.

In both cases, the kernel's shrinker code requests that the ARC shrinker
callback release some of its cache, and then it measures how many pages
were released.  However, it's measurement of released pages does not
include pages that are freed via `__free_pages()`, which is how the ARC
releases memory (via `abd_free_chunks()`).  Rather, the kernel shrinker
code is looking for pages to be placed on the lists of reclaimable pages
(which is separate from actually-free pages).

Because the kernel shrinker code doesn't detect that the ARC has
released pages, it may call the ARC shrinker callback many times,
resulting in the ARC "collapsing" down to `arc_c_min`.  This has several
negative impacts:

1. ZFS doesn't use RAM to cache data effectively.

2. In the direct reclaim case, a single page allocation may wait a long
time (e.g. more than a minute) while we evict the entire ARC.

3. Even with the improvements made in 67c0f0dedc ("ARC shrinking blocks
reads/writes"), occasionally `arc_size` may stay above `arc_c` for the
entire time of the ARC collapse, thus blocking ZFS read/write operations
in `arc_get_data_impl()`.

To address these issues, this commit limits the ways that the ARC
shrinker callback can be used by the kernel shrinker code, and mitigates
the impact of arc_is_overflowing() on ZFS read/write operations.

With this commit:

1. We limit the amount of data that can be reclaimed from the ARC via
the "direct reclaim" shrinker.  This limits the amount of time it takes
to allocate a single page.

2. We do not allow the ARC to shrink via kswapd (indirect reclaim).
Instead we rely on `arc_evict_zthr` to monitor free memory and reduce
the ARC target size to keep sufficient free memory in the system.  Note
that we can't simply rely on limiting the amount that we reclaim at once
(as for the direct reclaim case), because kswapd's "boosted" logic can
invoke the callback an unlimited number of times (see
`balance_pgdat()`).

3. When `arc_is_overflowing()` and we want to allocate memory,
`arc_get_data_impl()` will wait only for a multiple of the requested
amount of data to be evicted, rather than waiting for the ARC to no
longer be overflowing.  This allows ZFS reads/writes to make progress
even while the ARC is overflowing, while also ensuring that the eviction
thread makes progress towards reducing the total amount of memory used
by the ARC.

4. The amount of memory that the ARC always tries to keep free for the
rest of the system, `arc_sys_free` is increased.

5. Now that the shrinker callback is able to provide feedback to the
kernel's shrinker code about our progress, we can safely enable
the kswapd hook. This will allow the arc to receive notifications
when memory pressure is first detected by the kernel. We also
re-enable the appropriate kstats to track these callbacks.

Reviewed-by: Alexander Motin <mav@FreeBSD.org>
Reviewed-by: Ryan Moeller <ryan@iXsystems.com>
Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Co-authored-by: George Wilson <george.wilson@delphix.com>
Signed-off-by: Matthew Ahrens <mahrens@delphix.com>
Closes #10600
2020-07-31 21:10:52 -07:00
..
acl.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
atomic.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
byteorder.h Prefix zfs internal endian checks with _ZFS 2020-07-28 13:02:49 -07:00
callb.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
callo.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
cmn_err.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
condvar.h Disambiguate condvar API contract 2020-06-18 10:17:50 -07:00
console.h Mark functions as static 2020-06-18 12:20:38 -07:00
cred.h Linux compat: Minimum kernel version 3.10 2019-11-12 08:59:06 -08:00
ctype.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
debug.h Remove stale ASSERTV comment 2019-12-06 09:33:27 -08:00
disp.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
dkio.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
errno.h Move platform dependent errno aliases 2019-10-25 13:40:50 -07:00
fcntl.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
file.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
inttypes.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
isa_defs.h Prefix zfs internal endian checks with _ZFS 2020-07-28 13:02:49 -07:00
kmem_cache.h remove kmem_cache module parameter KMC_EXPIRE_AGE 2020-07-24 09:39:26 -07:00
kmem.h Move GFP flags kernel compatibility code 2020-06-08 16:33:46 -07:00
kstat.h Linux 5.6 compat: struct proc_ops 2020-02-07 11:03:53 -08:00
list.h Fixing gang ABD child removal race condition 2020-07-14 11:04:35 -07:00
Makefile.am Drop unnecessary srcdir paths 2020-06-24 18:20:18 -07:00
mod_os.h Wrap Linux module macros 2019-11-01 10:41:03 -07:00
mutex.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
param.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
proc.h Fix zfs send progress reporting 2020-04-20 10:12:48 -07:00
processor.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
procfs_list.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
random.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
rwlock.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
shrinker.h Revise ARC shrinker algorithm 2020-07-31 21:10:52 -07:00
sid.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
signal.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
simd.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
stat.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
strings.h Linux compat: Minimum kernel version 3.10 2019-11-12 08:59:06 -08:00
sunddi.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
sysmacros.h Move linux qsort def to platform header 2019-12-03 09:49:40 -08:00
systeminfo.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
taskq.h Improve performance of zio_taskq_member 2020-03-03 10:29:38 -08:00
thread.h Introduce names for ZTHRs 2020-07-29 09:43:33 -07:00
time.h Linux 5.6 compat: time_t 2020-02-27 09:31:02 -08:00
timer.h Linux compat: Minimum kernel version 3.10 2019-11-12 08:59:06 -08:00
trace_spl.h Add tracepoints for taskq entry lifetime events 2019-11-01 13:14:54 -07:00
trace_taskq.h Add tracepoints for taskq entry lifetime events 2019-11-01 13:14:54 -07:00
trace.h Add prototypes 2020-06-18 12:21:32 -07:00
tsd.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
types32.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
types.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
uio.h Add convenience wrappers for common uio usage 2020-06-14 10:09:55 -07:00
user.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
vfs.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
vmem.h Clean up OS-specific ARC and kmem code 2020-06-29 09:01:07 -07:00
vmsystm.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
vnode.h Add zfs_file_* interface, remove vnodes 2019-11-21 09:32:57 -08:00
wait.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00
zmod.h Linux compat: Minimum kernel version 3.10 2019-11-12 08:59:06 -08:00
zone.h OpenZFS restructuring - move platform specific headers 2019-09-05 09:34:54 -07:00