aboutsummaryrefslogtreecommitdiffstats
path: root/drivers/md/raid10.c
diff options
context:
space:
mode:
authorH. Peter Anvin <hpa@zytor.com>2013-06-12 10:37:43 -0400
committerNeilBrown <neilb@suse.de>2013-06-13 00:49:54 -0400
commit5026d7a9b2f3eb1f9bda66c18ac6bc3036ec9020 (patch)
treed87edf6a82c43ec53d43aed19ef6710fac6a1b67 /drivers/md/raid10.c
parente2d59925221cd562e07fee38ec8839f7209ae603 (diff)
md/raid1,5,10: Disable WRITE SAME until a recovery strategy is in place
There are cases where the kernel will believe that the WRITE SAME command is supported by a block device which does not, in fact, support WRITE SAME. This currently happens for SATA drivers behind a SAS controller, but there are probably a hundred other ways that can happen, including drive firmware bugs. After receiving an error for WRITE SAME the block layer will retry the request as a plain write of zeroes, but mdraid will consider the failure as fatal and consider the drive failed. This has the effect that all the mirrors containing a specific set of data are each offlined in very rapid succession resulting in data loss. However, just bouncing the request back up to the block layer isn't ideal either, because the whole initial request-retry sequence should be inside the write bitmap fence, which probably means that md needs to do its own conversion of WRITE SAME to write zero. Until the failure scenario has been sorted out, disable WRITE SAME for raid1, raid5, and raid10. [neilb: added raid5] This patch is appropriate for any -stable since 3.7 when write_same support was added. Cc: stable@vger.kernel.org Signed-off-by: H. Peter Anvin <hpa@linux.intel.com> Signed-off-by: NeilBrown <neilb@suse.de>
Diffstat (limited to 'drivers/md/raid10.c')
-rw-r--r--drivers/md/raid10.c3
1 files changed, 1 insertions, 2 deletions
diff --git a/drivers/md/raid10.c b/drivers/md/raid10.c
index aa9ed304951e..06c2cbe046e2 100644
--- a/drivers/md/raid10.c
+++ b/drivers/md/raid10.c
@@ -3651,8 +3651,7 @@ static int run(struct mddev *mddev)
3651 if (mddev->queue) { 3651 if (mddev->queue) {
3652 blk_queue_max_discard_sectors(mddev->queue, 3652 blk_queue_max_discard_sectors(mddev->queue,
3653 mddev->chunk_sectors); 3653 mddev->chunk_sectors);
3654 blk_queue_max_write_same_sectors(mddev->queue, 3654 blk_queue_max_write_same_sectors(mddev->queue, 0);
3655 mddev->chunk_sectors);
3656 blk_queue_io_min(mddev->queue, chunk_size); 3655 blk_queue_io_min(mddev->queue, chunk_size);
3657 if (conf->geo.raid_disks % conf->geo.near_copies) 3656 if (conf->geo.raid_disks % conf->geo.near_copies)
3658 blk_queue_io_opt(mddev->queue, chunk_size * conf->geo.raid_disks); 3657 blk_queue_io_opt(mddev->queue, chunk_size * conf->geo.raid_disks);