aboutsummaryrefslogtreecommitdiffstats
path: root/kernel
diff options
context:
space:
mode:
authorTejun Heo <tj@kernel.org>2013-08-28 17:33:37 -0400
committerGreg Kroah-Hartman <gregkh@linuxfoundation.org>2013-09-08 01:09:58 -0400
commit6ff96f7340e8a0b7b7e2c40a26bc47fb320e6475 (patch)
treea9dfcf85e24f44f8a41fcb6e8c48382d1e24377e /kernel
parenta0d9a083e450d581b98707bdb6837f55554e673b (diff)
workqueue: cond_resched() after processing each work item
commit b22ce2785d97423846206cceec4efee0c4afd980 upstream. If !PREEMPT, a kworker running work items back to back can hog CPU. This becomes dangerous when a self-requeueing work item which is waiting for something to happen races against stop_machine. Such self-requeueing work item would requeue itself indefinitely hogging the kworker and CPU it's running on while stop_machine would wait for that CPU to enter stop_machine while preventing anything else from happening on all other CPUs. The two would deadlock. Jamie Liu reports that this deadlock scenario exists around scsi_requeue_run_queue() and libata port multiplier support, where one port may exclude command processing from other ports. With the right timing, scsi_requeue_run_queue() can end up requeueing itself trying to execute an IO which is asked to be retried while another device has an exclusive access, which in turn can't make forward progress due to stop_machine. Fix it by invoking cond_resched() after executing each work item. Signed-off-by: Tejun Heo <tj@kernel.org> Reported-by: Jamie Liu <jamieliu@google.com> References: http://thread.gmane.org/gmane.linux.kernel/1552567 Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
Diffstat (limited to 'kernel')
-rw-r--r--kernel/workqueue.c9
1 files changed, 9 insertions, 0 deletions
diff --git a/kernel/workqueue.c b/kernel/workqueue.c
index 6f019219ddbd..e52d002d3893 100644
--- a/kernel/workqueue.c
+++ b/kernel/workqueue.c
@@ -2188,6 +2188,15 @@ __acquires(&pool->lock)
2188 dump_stack(); 2188 dump_stack();
2189 } 2189 }
2190 2190
2191 /*
2192 * The following prevents a kworker from hogging CPU on !PREEMPT
2193 * kernels, where a requeueing work item waiting for something to
2194 * happen could deadlock with stop_machine as such work item could
2195 * indefinitely requeue itself while all other CPUs are trapped in
2196 * stop_machine.
2197 */
2198 cond_resched();
2199
2191 spin_lock_irq(&pool->lock); 2200 spin_lock_irq(&pool->lock);
2192 2201
2193 /* clear cpu intensive status */ 2202 /* clear cpu intensive status */