aboutsummaryrefslogtreecommitdiffstats
path: root/drivers
diff options
context:
space:
mode:
authorSrivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com>2014-05-05 03:22:39 -0400
committerRafael J. Wysocki <rafael.j.wysocki@intel.com>2014-05-06 18:32:39 -0400
commitca654dc3a93d3b47dddc0c24a98043060bbb256b (patch)
tree9231abdfcd4a9c51d0d3360fa3c72496b811b88b /drivers
parent6b17ddb2a50b9403c6948ec3e4ea2bd2d7064ff3 (diff)
cpufreq: Catch double invocations of cpufreq_freq_transition_begin/end
Some cpufreq drivers were redundantly invoking the _begin() and _end() APIs around frequency transitions, and this double invocation (one from the cpufreq core and the other from the cpufreq driver) used to result in a self-deadlock, leading to system hangs during boot. (The _begin() API makes contending callers wait until the previous invocation is complete. Hence, the cpufreq driver would end up waiting on itself!). Now all such drivers have been fixed, but debugging this issue was not very straight-forward (even lockdep didn't catch this). So let us add a debug infrastructure to the cpufreq core to catch such issues more easily in the future. We add a new field called 'transition_task' to the policy structure, to keep track of the task which is performing the frequency transition. Using this field, we make note of this task during _begin() and print a warning if we find a case where the same task is calling _begin() again, before completing the previous frequency transition using the corresponding _end(). We have left out ASYNC_NOTIFICATION drivers from this debug infrastructure for 2 reasons: 1. At the moment, we have no way to avoid a particular scenario where this debug infrastructure can emit false-positive warnings for such drivers. The scenario is depicted below: Task A Task B /* 1st freq transition */ Invoke _begin() { ... ... } Change the frequency /* 2nd freq transition */ Invoke _begin() { ... //waiting for B to ... //finish _end() for ... //the 1st transition ... | Got interrupt for successful ... | change of frequency (1st one). ... | ... | /* 1st freq transition */ ... | Invoke _end() { ... | ... ... V } ... ... } This scenario is actually deadlock-free because, once Task A changes the frequency, it is Task B's responsibility to invoke the corresponding _end() for the 1st frequency transition. Hence it is perfectly legal for Task A to go ahead and attempt another frequency transition in the meantime. (Of course it won't be able to proceed until Task B finishes the 1st _end(), but this doesn't cause a deadlock or a hang). The debug infrastructure cannot handle this scenario and will treat it as a deadlock and print a warning. To avoid this, we exclude such drivers from the purview of this code. 2. Luckily, we don't _need_ this infrastructure for ASYNC_NOTIFICATION drivers at all! The cpufreq core does not automatically invoke the _begin() and _end() APIs during frequency transitions in such drivers. Thus, the driver alone is responsible for invoking _begin()/_end() and hence there shouldn't be any conflicts which lead to double invocations. So, we can skip these drivers, since the probability that such drivers will hit this problem is extremely low, as outlined above. Signed-off-by: Srivatsa S. Bhat <srivatsa.bhat@linux.vnet.ibm.com> Acked-by: Viresh Kumar <viresh.kumar@linaro.org> Signed-off-by: Rafael J. Wysocki <rafael.j.wysocki@intel.com>
Diffstat (limited to 'drivers')
-rw-r--r--drivers/cpufreq/cpufreq.c14
1 files changed, 14 insertions, 0 deletions
diff --git a/drivers/cpufreq/cpufreq.c b/drivers/cpufreq/cpufreq.c
index a517da996aaf..bfe82b63875f 100644
--- a/drivers/cpufreq/cpufreq.c
+++ b/drivers/cpufreq/cpufreq.c
@@ -365,6 +365,18 @@ static void cpufreq_notify_post_transition(struct cpufreq_policy *policy,
365void cpufreq_freq_transition_begin(struct cpufreq_policy *policy, 365void cpufreq_freq_transition_begin(struct cpufreq_policy *policy,
366 struct cpufreq_freqs *freqs) 366 struct cpufreq_freqs *freqs)
367{ 367{
368
369 /*
370 * Catch double invocations of _begin() which lead to self-deadlock.
371 * ASYNC_NOTIFICATION drivers are left out because the cpufreq core
372 * doesn't invoke _begin() on their behalf, and hence the chances of
373 * double invocations are very low. Moreover, there are scenarios
374 * where these checks can emit false-positive warnings in these
375 * drivers; so we avoid that by skipping them altogether.
376 */
377 WARN_ON(!(cpufreq_driver->flags & CPUFREQ_ASYNC_NOTIFICATION)
378 && current == policy->transition_task);
379
368wait: 380wait:
369 wait_event(policy->transition_wait, !policy->transition_ongoing); 381 wait_event(policy->transition_wait, !policy->transition_ongoing);
370 382
@@ -376,6 +388,7 @@ wait:
376 } 388 }
377 389
378 policy->transition_ongoing = true; 390 policy->transition_ongoing = true;
391 policy->transition_task = current;
379 392
380 spin_unlock(&policy->transition_lock); 393 spin_unlock(&policy->transition_lock);
381 394
@@ -392,6 +405,7 @@ void cpufreq_freq_transition_end(struct cpufreq_policy *policy,
392 cpufreq_notify_post_transition(policy, freqs, transition_failed); 405 cpufreq_notify_post_transition(policy, freqs, transition_failed);
393 406
394 policy->transition_ongoing = false; 407 policy->transition_ongoing = false;
408 policy->transition_task = NULL;
395 409
396 wake_up(&policy->transition_wait); 410 wake_up(&policy->transition_wait);
397} 411}