aboutsummaryrefslogtreecommitdiffstats
diff options
context:
space:
mode:
authorHarshad Shirwadkar <harshads@google.com>2018-01-10 00:13:13 -0500
committerTheodore Ts'o <tytso@mit.edu>2018-01-10 00:13:13 -0500
commitabbc3f9395c76d554a9ed27d4b1ebfb5d9b0e4ca (patch)
tree463181185ebf743f7dbee86b6215b20302b59b33
parent9ee93ba3c430d5b5140ab72738dc70c4c54990e0 (diff)
ext4: fix a race in the ext4 shutdown path
This patch fixes a race between the shutdown path and bio completion handling. In the ext4 direct io path with async io, after submitting a bio to the block layer, if journal starting fails, ext4_direct_IO_write() would bail out pretending that the IO failed. The caller would have had no way of knowing whether or not the IO was successfully submitted. So instead, we return -EIOCBQUEUED in this case. Now, the caller knows that the IO was submitted. The bio completion handler takes care of the error. Tested: Ran the shutdown xfstest test 461 in loop for over 2 hours across 4 machines resulting in over 400 runs. Verified that the race didn't occur. Usually the race was seen in about 20-30 iterations. Signed-off-by: Harshad Shirwadkar <harshads@google.com> Signed-off-by: Theodore Ts'o <tytso@mit.edu> Cc: stable@vger.kernel.org
-rw-r--r--fs/ext4/inode.c16
1 files changed, 12 insertions, 4 deletions
diff --git a/fs/ext4/inode.c b/fs/ext4/inode.c
index 534a9130f625..4c2f8b57bdc7 100644
--- a/fs/ext4/inode.c
+++ b/fs/ext4/inode.c
@@ -3767,10 +3767,18 @@ static ssize_t ext4_direct_IO_write(struct kiocb *iocb, struct iov_iter *iter)
3767 /* Credits for sb + inode write */ 3767 /* Credits for sb + inode write */
3768 handle = ext4_journal_start(inode, EXT4_HT_INODE, 2); 3768 handle = ext4_journal_start(inode, EXT4_HT_INODE, 2);
3769 if (IS_ERR(handle)) { 3769 if (IS_ERR(handle)) {
3770 /* This is really bad luck. We've written the data 3770 /*
3771 * but cannot extend i_size. Bail out and pretend 3771 * We wrote the data but cannot extend
3772 * the write failed... */ 3772 * i_size. Bail out. In async io case, we do
3773 ret = PTR_ERR(handle); 3773 * not return error here because we have
3774 * already submmitted the corresponding
3775 * bio. Returning error here makes the caller
3776 * think that this IO is done and failed
3777 * resulting in race with bio's completion
3778 * handler.
3779 */
3780 if (!ret)
3781 ret = PTR_ERR(handle);
3774 if (inode->i_nlink) 3782 if (inode->i_nlink)
3775 ext4_orphan_del(NULL, inode); 3783 ext4_orphan_del(NULL, inode);
3776 3784