mirror_zfs

mirror of https://git.proxmox.com/git/mirror_zfs.git synced 2026-04-17 08:54:52 +03:00

Author	SHA1	Message	Date
Brian Behlendorf	7a060636b0	Conserve stack in dsl_scan_visitbp() The recursive call chain dsl_scan_visitbp() -> dsl_scan_recurse() -> dsl_scan_visitdnode() -> dsl_scan_visitbp has been observed to consume considerable stack resulting in a stack overflow (>8k). The cleanest way I see to fix this with minimal impact to the existing flow of code, and with the fewest performance concerns, is to always inline dsl_scan_recurse() and dsl_scan_visitdnode(). While this will increase the function size of dsl_scan_visitbp(), by 4660 bytes, it also reduces the stack requirements by removing the function call overhead. Issue #174	2011-04-26 13:37:35 -07:00
Brian Behlendorf	44e9e34793	Merged pull request #212 from dajhorn/hostid. Use gethostid in the Linux convention.	2011-04-26 13:30:27 -07:00
Brian Behlendorf	701b1f8168	Fix zvol deadlock It's possible for a zvol_write thread to enter direct memory reclaim while holding open a transaction group. This results in the system attempting to write out data to the disk to free memory. Unfortunately, this can't succeed because the the thread doing reclaim is holding open the txg which must be closed to be synced to disk. To prevent this the offending allocation is marked KM_PUSHPAGE which will prevent it from attempting writeback. Closes #191	2011-04-26 12:56:35 -07:00
Darik Horn	492b8e9e7b	Use gethostid in the Linux convention. Disable the gethostid() override for Solaris behavior because Linux systems implement the POSIX standard in a way that allows a negative result. Mask the gethostid() result to the lower four bytes, like coreutils does in /usr/bin/hostid, to prevent junk bits or sign-extension on systems that have an eight byte long type. This can cause a spurious hostid mismatch that prevents zpool import on 64-bit systems.	2011-04-25 10:36:17 -05:00
Brian Behlendorf	a1cc0b3290	Fix 32-bit MAXOFFSET_T definition Having MAXOFFSET_T defined to 0x7fffffffl was artificially limiting the maximum file size on 32-bit systems. In reality MAXOFFSET_T is used when working with 'long long' types and as such we now define it as LLONG_MAX. This resolves the 2GB file size limit for files and additionally allows zvols greater than 2GB on 32-bit systems. Closes #136 Closes #81	2011-04-22 16:21:26 -07:00
Brian Behlendorf	e2448b0e62	Fix spurious -EFAULT when setting I/O scheduler Occasionally we would see an -EFAULT returned when setting the I/O scheduler on a vdev. This was caused an improperly formatted user mode helper command. This commit restructures the command to something simpler, allocates space for it dynamically to save stack, and removes the retry logic which is no longer needed. Closes #169	2011-04-22 14:55:35 -07:00
Brian Behlendorf	6a8f9b6bf0	Enforce ARC meta-data limits This change ensures the ARC meta-data limits are enforced. Without this enforcement meta-data can grow to consume all of the ARC cache pushing out data and hurting performance. The cache is aggressively reclaimed but this is a soft and not a hard limit. The cache may exceed the set limit briefly before being brought under control. By default 25% of the ARC capacity can be used for meta-data. This limit can be tuned by setting the 'zfs_arc_meta_limit' module option. Once this limit is exceeded meta-data reclaim will occur in 3 percent chunks, or may be tuned using 'arc_reduce_dnlc_percent'. Closes #193	2011-04-21 13:49:31 -07:00
Gunnar Beutner	36df284366	Fixed a use-after-free bug in zfs_zget(). Fixed a bug where zfs_zget could access a stale znode pointer when the inode had already been removed from the inode cache via iput -> iput_final -> ... -> zfs_zinactive but the corresponding SA handle was still alive. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes #180	2011-04-21 13:48:01 -07:00
Brian Behlendorf	d247f2a3cc	Suppress 'zfs receive' memory warning As part of zfs_ioc_recv() a zfs_cmd_t is allocated in the kernel which is 17808 bytes in size. This sort of thing in general should be avoided. However, since this should be an infrequent event for now we allow it and simply suppress the warning with the KM_NODEBUG flag. This can be revisited latter if/when it becomes an issue. Closes #178	2011-04-20 10:22:31 -07:00
Aniruddha Shankar	9caef54224	Added required runlevel info for init on Debian Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes #208	2011-04-20 09:55:57 -07:00
Brian Behlendorf	3fce1d0962	Update zconfig.sh to use new zvol names This change should have occured when we commited the new udev rules for zvols. Basically, the test script is just out of date. We need to update it to use the /dev/zvol/ device names, and to expect the more common -partN suffixes. I added a udev_trigger() call in zconfig_partition() and zconfig_zvol_device_stat() to ensure that all the udev rules have run before. This ensures the devices are available to subsequent commands and closes a small race. Finally, I was forced added a small 'sleep 1' to test 10. I was observing occassional failures in my VM due to the device still claiming to be busy. Delaying betwen the various methods of adding/removing a vdev avoids the issue. Closes #207	2011-04-19 16:33:41 -07:00
Brian Behlendorf	18f4238277	Add parted and lsscsi dependencies to zfs-test The zfault.sh and zconfig.sh test scripts requires the parted utility, the lsscsi utility, and the scsi_debug module. To ensure the utilities are available they have been added as dependencies to zfs-test package. Checking for scsi_debug is a little more problematic because if it's missing you will need to build it. For clarity the documention has been updated to mention this. Closes #205 Closes #206	2011-04-19 15:22:46 -07:00
Brian Behlendorf	cbb1e401f9	Add Gunnar Beutner to AUTHORS for his contributions	2011-04-19 14:14:51 -07:00
Gunnar Beutner	bec30953cd	Truncate the xattr znode when updating existing attributes. If the attribute's new value was shorter than the old one the old code would leave parts of the old value in the xattr znode. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes #203	2011-04-19 14:14:40 -07:00
Gunnar Beutner	274b7e79f3	Added missing initialization for va.va_dentry in zfs_get_xattrdir. Without this we may mistakenly believe we have a dentry and try to d_instantiate() it. This will result in the following BUG. It's important to note that while the xattr directory has an inode assoicated with it we never create a dentry for it. kernel BUG at fs/dcache.c:1418! Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov> Closes #202	2011-04-19 13:57:41 -07:00
Richard Laager	826ab7ad19	Support IEC base-2 prefixes Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 12:56:21 -07:00
Richard Laager	4da4a9e1a7	Cleanup various Sun/Solaris-isms Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:42:42 -07:00
Richard Laager	251eb26d17	Update the version in the zpool upgrade example Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:40:45 -07:00
Richard Laager	3b2041509f	Normalize the deferred destruction language Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:40:26 -07:00
Richard Laager	0d122e21ff	Improve the wording about hot spares Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:39:37 -07:00
Richard Laager	6b92390fe8	Improve some quoting consistency Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:38:56 -07:00
Richard Laager	577468215b	Remove a stray tab Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:38:27 -07:00
Richard Laager	25d4782bac	Linux has "partitions", not "slices" Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:38:04 -07:00
Richard Laager	54e5f2264d	Use Linux disk names in zpool.8 Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:37:31 -07:00
Richard Laager	1dc3fea59e	More and correct an example in zpool.8 Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:36:17 -07:00
Richard Laager	1fe2e23771	Change /dev/dsk -> /dev in the man pages Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:35:21 -07:00
Richard Laager	2d1b7b0b97	Correct man page section numbers for Linux Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-19 11:33:51 -07:00
Brian Behlendorf	12c1acde76	Set -Wno-unused-but-set-variable globally As of gcc-4.6 the option -Wunused-but-set-variable is enabled by default. While this is a useful warning there are numerous places in the ZFS code when a variable is set and then only checked in an ASSERT(). To avoid having to update every instance of this in the code we now set -Wno-unused-but-set-variable to suppress the warning. Additionally, when building with --enable-debug and -Werror set these warning also become fatal. We can reevaluate the suppression of these error at a later time if it becomes an issue. For now we are basically just reverting to the previous gcc behavior.	2011-04-19 10:44:10 -07:00
Brian Behlendorf	79713039a2	Fix gcc configure warnings Newer versions of gcc are getting smart enough to detect the sloppy syntax used for the autoconf tests. It is now generating warnings for unused/undeclared variables. Newer version of gcc even have the -Wunused-but-set-variable option set by default. This isn't a problem except when -Werror is set and they get promoted to an error. In this case the autoconf test will return an incorrect result which will result in a build failure latter on. To handle this I'm tightening up many of the autoconf tests to explicitly mark variables as unused to suppress the gcc warning. Remember, all of the autoconf code can never actually be run we just want to get a clean build error to detect which APIs are available. Never using a variable is absolutely fine for this. Closes #176	2011-04-19 10:10:47 -07:00
Brian Behlendorf	03514b0110	Fix gcc compiler warning, parse_option() When compiling ZFS in user space gcc-4.6.0 correctly identifies the variable 'value' as being set but never used. This generates a warning and a build failure when using --enable-debug. Once again this is correct but I'm reluctant to remove 'value' because we are breaking the string in to name/value pairs. While it is not used now there's a good chance it will be soon and I'd rather not have to reinvent this. To suppress the warning with just as a VERIFY(). This was observed under Fedora 15. cmd/mount_zfs/mount_zfs.c: In function ‘parse_option’: cmd/mount_zfs/mount_zfs.c:112:21: error: variable ‘value’ set but not used [-Werror=unused-but-set-variable]	2011-04-19 09:04:51 -07:00
Brian Behlendorf	0fe3d820f5	Fix gcc compiler warning, dsl_pool_create() When compiling ZFS in user space gcc-4.6.0 correctly identifies the variable 'os' as being set but never used. This generates a warning and a build failure when using --enable-debug. However, the code is correct we only want to use 'os' for the kernel space builds. To suppress the warning the call was wrapped with a VERIFY() which has the nice side effect of ensuring the 'os' actually never is NULL. This was observed under Fedora 15. module/zfs/dsl_pool.c: In function ‘dsl_pool_create’: module/zfs/dsl_pool.c:229:12: error: variable ‘os’ set but not used [-Werror=unused-but-set-variable]	2011-04-19 09:04:51 -07:00
Brian Behlendorf	e30c0ada6d	Linux 2.6.39 compat, invalidate_inodes() Update code to use the spl_invalidate_inodes() wrapper. This hides some of the complexity of determining if invalidate_inodes() was exported, and if so what is its prototype. The second argument of spl_invalidate_inodes() determined the behavior of how dirty inodes are handled. By passing a zero we are indicated that we want those inodes to be treated as busy and skipped.	2011-04-19 08:57:23 -07:00
Brian Behlendorf	cbf81d4c3b	Autogen refresh for kernel-insert-inode-locked.m4 Several Makefile.in's were accidentally not updated when the kernel-insert-inode-locked.m4 check was added. This change simply refreshes the missed files.	2011-04-18 15:47:18 -07:00
Brian Behlendorf	0f6fd32927	Fix rebuildable RPMs for el6/ch5 When rebuilding the source RPM under el5 you need to append the target_cpu. However, under el6/ch5 things are packaged correctly and the arch is already part of kver. For this reason it also needs to be stripped from kver when setting kverpkg.	2011-04-08 10:22:42 -07:00
Ned Bass	38baef530d	Align closing fi in mount-zfs.sh	2011-04-08 10:05:18 -07:00
Ned Bass	c4803a9663	Use consistent indentation in mount-zfs.sh	2011-04-08 09:25:18 -07:00
Richard Laager	f132992361	Fix a couple comments Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-07 15:46:46 -07:00
Brian Behlendorf	0d3ac5e735	Linux 2.6.29 compat, credentials The .sync_fs fix as applied did not use the updated SPL credential API. This broke builds on Debian Lenny, this change applies the needed fix to use the portable API. The original credential changes are part of commit `81e97e2187`.	2011-04-07 14:27:09 -07:00
Brian Behlendorf	9b1d6001bc	Prep zfs-0.6.0-rc3 tag Create the third 0.6.0 release candidate tag (rc3).	2011-04-07 10:49:55 -07:00
Manuel Amador (Rudd-O)	f5ef7150ea	Update zfs.fedora init script Apply all of Rudd-O's changes for the Fedora init script. The initial init script was one I threw together based on Rudd-O's original work. It worked for me but it has some flaws. Rudd-O has invested considerable time updating it to be significantly smarter. It now handles using ZFS as your root filesystem plus various other quirks. Since he is familiar with the right way to do things on Fedora and has tested this init script we are integrating all of his changes. Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-07 10:48:11 -07:00
Manuel Amador (Rudd-O)	6583dcacdc	Permit both mountpoint=legacy and mountpoint=/ in initrd Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-07 10:31:02 -07:00
Manuel Amador (Rudd-O)	8610b52bd4	Added .gitignore for mount.zfs and zvol_id Signed-off-by: Brian Behlendorf <behlendorf1@llnl.gov>	2011-04-07 10:28:38 -07:00
Brian Behlendorf	eec8164771	Fix ASSERTION(!dsl_pool_sync_context(tx->tx_pool)) Disable the normal reclaim path for the txg_sync thread. This ensures the thread will never enter dmu_tx_assign() which can otherwise occur due to direct reclaim. If this is allowed to happen the system can deadlock. Direct reclaim call path: ->shrink_icache_memory->prune_icache->dispose_list-> clear_inode->zpl_clear_inode->zfs_inactive->dmu_tx_assign	2011-04-07 09:52:16 -07:00
Brian Behlendorf	7cb67b45f3	Add direct+indirect ARC reclaim Under OpenSolaris all memory reclaim is done asyncronously. Under Linux memory reclaim is done asynchronously _and_ synchronously. When a process allocates memory with GFP_KERNEL it explicitly allows the kernel to do reclaim on its behalf to satify the allocation. If that GFP_KERNEL allocation fails the kernel may take more drastic measures to reclaim the memory such as killing user space processes. This was observed to happen with ZFS because the ARC could consume a large fraction of the system memory but no synchronous reclaim could be performed on it. The result was GFP_KERNEL allocations could fail resulting in OOM events, and only moments latter the arc_reclaim thread would free unused memory from the ARC. This change leaves the arc_thread in place to manage the fundamental ARC behavior. But it adds a synchronous (direct) reclaim path for the ARC which can be called when memory is badly needed. It also adds an asynchronous (indirect) reclaim path which is called much more frequently to prune the ARC slab caches.	2011-04-07 09:52:10 -07:00
Brian Behlendorf	1834f2d8b7	Add missing arcstats The following useful values were missing the arcstats. This change adds them in to provide greater visibility in to the arcs behavior. arc_no_grow 4 0 arc_tempreserve 4 0 arc_loaned_bytes 4 0 arc_meta_used 4 624774592 arc_meta_limit 4 400785408 arc_meta_max 4 625594176	2011-04-07 09:52:05 -07:00
Brian Behlendorf	c85b224faf	Call d_instantiate before unlocking inode Under Linux a dentry referencing an inode must be instantiated before the inode is unlocked. To accomplish this without overly modifing the core ZFS code the dentry it passed via the vattr_t. There are cases such as replay when a dentry is not available. In which case it is obviously not initialized at inode creation time, if a dentry is needed it will be spliced as when required via d_lookup().	2011-04-07 09:51:57 -07:00
Brian Behlendorf	d433c20651	Fix `make distclean` for `./configure --with-config=user Making distclean in module make[1]: Entering directory `/zfs/module' make -C SUBDIRS=`pwd` clean make: Entering an unknown directory make: *** SUBDIRS=/zfs/module: No such file or directory. Stop. When using --with-config=user the 'distclean' target would fail because it assumes the kernel configuration infrastrure is set up. This is not the case, nor does it need to be, because the '--with-config=user' option will prune the entire ./module subtree from SUBDIRS. This prevents most build rules from operating in the ./module directory. However, the 'dist*' rules will still traverse this directory because it is listed in DIST_SUBDIRS. This is correct because we need to ensure the dist rules package the directory contents regardless of the configuration for the 'dist' rule. The correct way to handle this is to only invoke the kernel build system as part of the 'clean' rule when CONFIG_KERNEL_TRUE is set. Initial fix provided by Darik Horn <dajhorn@vanadac.com>. This commit is a slightly refined form of the original.	2011-04-05 13:33:28 -07:00
Ned Bass	fa417e57a6	Call udevadm trigger more safely Some udev hooks are not designed to be idempotent, so calling udevadm trigger outside of the distribution's initialization scripts can have unexpected (and potentially dangerous) side effects. For example, the system time may change or devices may appear multiple times. See Ubuntu launchpad bug 320200 and this mailing list post for more details: https://lists.ubuntu.com/archives/ubuntu-devel/2009-January/027260.html To avoid these problems we call udevadm trigger with --action=change --subsystem-match=block. The first argument tells udev just to refresh devices, and make sure everything's as it should be. The second argument limits the scope to block devices, so devices belonging to other subsystems cannot be affected. This doesn't fix the problem on older udev implementations that don't provide udevadm but instead have udevtrigger as a standalone program. In this case the above options aren't available so there's no way to call call udevtrigger safely. But we can live with that since this issue only exists in optional test and helper scripts, and most zfs-on-linux users are running newer systems anyways.	2011-04-05 13:00:51 -07:00
Brian Behlendorf	34e4864009	Update CHAOS 5 Packaging The CHAOS 5 kernels are now packaged identially to the RHEL6 kernels. Therefore we can simply use the RHEL6 rules in the spec file when building packages.	2011-03-31 13:49:42 -07:00
Brian Behlendorf	e88b041ed6	Fix libzpool cv_* build error This build failure was accidentally introduced by previous commit `bfd214a` which fixed the load average. Unfortunately, the wrapper for cv_wait_interruptible was not available in the zfs_context.h user compatibility code. I failed to notice this because I didn't rebuild everything cleanly before committing. undefined reference to `cv_wait_interruptible' collect2: ld returned 1 exit status Closes #181	2011-03-31 12:20:53 -07:00

... 7 8 9 10 11 ...

765 Commits