Files
mirror_zfs/module/zfs
Matthew Ahrens ba67d82142 Improve zfs receive performance with lightweight write
The performance of `zfs receive` can be bottlenecked on the CPU consumed
by the `receive_writer` thread, especially when receiving streams with
small compressed block sizes.  Much of the CPU is spent creating and
destroying dbuf's and arc buf's, one for each `WRITE` record in the send
stream.

This commit introduces the concept of "lightweight writes", which allows
`zfs receive` to write to the DMU by providing an ABD, and instantiating
only a new type of `dbuf_dirty_record_t`.  The dbuf and arc buf for this
"dirty leaf block" are not instantiated.

Because there is no dbuf with the dirty data, this mechanism doesn't
support reading from "lightweight-dirty" blocks (they would see the
on-disk state rather than the dirty data).  Since the dedup-receive code
has been removed, `zfs receive` is write-only, so this works fine.

Because there are no arc bufs for the received data, the received data
is no longer cached in the ARC.

Testing a receive of a stream with average compressed block size of 4KB,
this commit improves performance by 50%, while also reducing CPU usage
by 50% of a CPU.  On a per-block basis, CPU consumed by receive_writer()
and dbuf_evict() is now 1/7th (14%) of what it was.

Baseline: 450MB/s, CPU in receive_writer() 40% + dbuf_evict() 35%
New: 670MB/s, CPU in receive_writer() 17% + dbuf_evict() 0%

The code is also restructured in a few ways:

Added a `dr_dnode` field to the dbuf_dirty_record_t.  This simplifies
some existing code that no longer needs `DB_DNODE_ENTER()` and related
routines.  The new field is needed by the lightweight-type dirty record.

To ensure that the `dr_dnode` field remains valid until the dirty record
is freed, we have to ensure that the `dnode_move()` doesn't relocate the
dnode_t.  To do this we keep a hold on the dnode until it's zio's have
completed.  This is already done by the user-accounting code
(`userquota_updates_task()`), this commit extends that so that it always
keeps the dnode hold until zio completion (see `dnode_rele_task()`).

`dn_dirty_txg` was previously zeroed when the dnode was synced.  This
was not necessary, since its meaning can be "when was this dnode last
dirtied".  This change simplifies the new `dnode_rele_task()` code.

Removed some dead code related to `DRR_WRITE_BYREF` (dedup receive).

Reviewed-by: Brian Behlendorf <behlendorf1@llnl.gov>
Reviewed-by: Paul Dagnelie <pcd@delphix.com>
Reviewed-by: George Wilson <gwilson@delphix.com>
Signed-off-by: Matthew Ahrens <mahrens@delphix.com>
Closes #11105
2020-12-11 10:26:02 -08:00
..
2020-11-13 13:51:51 -08:00
2020-12-10 14:09:23 -08:00
2020-12-10 14:09:23 -08:00
2020-08-20 10:30:06 -07:00
2019-07-26 10:54:14 -07:00
2020-07-29 16:35:33 -07:00
2020-07-29 16:35:33 -07:00
2019-06-19 09:48:12 -07:00
2020-08-17 15:25:37 -07:00
2020-06-18 12:21:18 -07:00
2019-12-03 09:51:44 -08:00
2020-11-02 11:51:12 -08:00
2020-11-10 10:37:10 -08:00
2020-11-13 13:51:51 -08:00
2020-09-04 10:29:39 -07:00
2020-06-18 12:21:25 -07:00
2020-09-04 10:34:28 -07:00
2020-06-18 12:21:25 -07:00
2017-10-11 16:54:48 -04:00
2020-06-18 12:21:25 -07:00
2020-11-13 13:51:51 -08:00
2020-11-13 13:51:51 -08:00
2020-11-13 13:51:51 -08:00
2020-12-10 14:09:23 -08:00
2019-06-19 09:48:12 -07:00
2020-07-29 16:35:33 -07:00
2020-07-29 16:35:33 -07:00
2020-08-26 21:41:09 -07:00
2020-06-18 12:21:25 -07:00
2020-06-18 12:21:25 -07:00
2020-06-18 12:21:25 -07:00
2019-09-02 17:56:41 -07:00
2020-11-13 13:51:51 -08:00
2020-12-10 14:09:23 -08:00
2020-12-10 14:09:23 -08:00
2019-10-09 10:36:03 -07:00
2020-11-13 13:51:51 -08:00
2020-11-13 13:51:51 -08:00
2020-11-13 13:51:51 -08:00
2020-11-13 13:51:51 -08:00
2020-11-13 13:51:51 -08:00
2020-11-20 13:14:45 -08:00
2020-07-29 16:35:33 -07:00
2020-06-18 12:21:25 -07:00
2019-09-02 17:56:41 -07:00
2020-10-02 17:44:10 -07:00
2020-06-18 12:20:38 -07:00
2020-11-13 13:51:51 -08:00
2020-10-02 17:44:10 -07:00
2020-11-24 09:28:42 -08:00
2020-06-18 12:20:38 -07:00
2020-11-13 13:51:51 -08:00
2020-11-13 13:51:51 -08:00
2020-06-18 12:21:25 -07:00
2020-06-18 12:21:18 -07:00
2020-09-03 20:09:52 -07:00
2020-11-17 09:50:52 -08:00