diff options
Diffstat (limited to 'Documentation/RCU/stallwarn.txt')
-rw-r--r-- | Documentation/RCU/stallwarn.txt | 41 |
1 files changed, 31 insertions, 10 deletions
diff --git a/Documentation/RCU/stallwarn.txt b/Documentation/RCU/stallwarn.txt index 44c6dcc93d6d..4e959208f736 100644 --- a/Documentation/RCU/stallwarn.txt +++ b/Documentation/RCU/stallwarn.txt | |||
@@ -1,22 +1,25 @@ | |||
1 | Using RCU's CPU Stall Detector | 1 | Using RCU's CPU Stall Detector |
2 | 2 | ||
3 | The CONFIG_RCU_CPU_STALL_DETECTOR kernel config parameter enables | 3 | The rcu_cpu_stall_suppress module parameter enables RCU's CPU stall |
4 | RCU's CPU stall detector, which detects conditions that unduly delay | 4 | detector, which detects conditions that unduly delay RCU grace periods. |
5 | RCU grace periods. The stall detector's idea of what constitutes | 5 | This module parameter enables CPU stall detection by default, but |
6 | "unduly delayed" is controlled by a set of C preprocessor macros: | 6 | may be overridden via boot-time parameter or at runtime via sysfs. |
7 | The stall detector's idea of what constitutes "unduly delayed" is | ||
8 | controlled by a set of kernel configuration variables and cpp macros: | ||
7 | 9 | ||
8 | RCU_SECONDS_TILL_STALL_CHECK | 10 | CONFIG_RCU_CPU_STALL_TIMEOUT |
9 | 11 | ||
10 | This macro defines the period of time that RCU will wait from | 12 | This kernel configuration parameter defines the period of time |
11 | the beginning of a grace period until it issues an RCU CPU | 13 | that RCU will wait from the beginning of a grace period until it |
12 | stall warning. This time period is normally ten seconds. | 14 | issues an RCU CPU stall warning. This time period is normally |
15 | ten seconds. | ||
13 | 16 | ||
14 | RCU_SECONDS_TILL_STALL_RECHECK | 17 | RCU_SECONDS_TILL_STALL_RECHECK |
15 | 18 | ||
16 | This macro defines the period of time that RCU will wait after | 19 | This macro defines the period of time that RCU will wait after |
17 | issuing a stall warning until it issues another stall warning | 20 | issuing a stall warning until it issues another stall warning |
18 | for the same stall. This time period is normally set to thirty | 21 | for the same stall. This time period is normally set to three |
19 | seconds. | 22 | times the check interval plus thirty seconds. |
20 | 23 | ||
21 | RCU_STALL_RAT_DELAY | 24 | RCU_STALL_RAT_DELAY |
22 | 25 | ||
@@ -80,6 +83,24 @@ o A CPU looping with bottom halves disabled. This condition can | |||
80 | o For !CONFIG_PREEMPT kernels, a CPU looping anywhere in the kernel | 83 | o For !CONFIG_PREEMPT kernels, a CPU looping anywhere in the kernel |
81 | without invoking schedule(). | 84 | without invoking schedule(). |
82 | 85 | ||
86 | o A CPU-bound real-time task in a CONFIG_PREEMPT kernel, which might | ||
87 | happen to preempt a low-priority task in the middle of an RCU | ||
88 | read-side critical section. This is especially damaging if | ||
89 | that low-priority task is not permitted to run on any other CPU, | ||
90 | in which case the next RCU grace period can never complete, which | ||
91 | will eventually cause the system to run out of memory and hang. | ||
92 | While the system is in the process of running itself out of | ||
93 | memory, you might see stall-warning messages. | ||
94 | |||
95 | o A CPU-bound real-time task in a CONFIG_PREEMPT_RT kernel that | ||
96 | is running at a higher priority than the RCU softirq threads. | ||
97 | This will prevent RCU callbacks from ever being invoked, | ||
98 | and in a CONFIG_TREE_PREEMPT_RCU kernel will further prevent | ||
99 | RCU grace periods from ever completing. Either way, the | ||
100 | system will eventually run out of memory and hang. In the | ||
101 | CONFIG_TREE_PREEMPT_RCU case, you might see stall-warning | ||
102 | messages. | ||
103 | |||
83 | o A bug in the RCU implementation. | 104 | o A bug in the RCU implementation. |
84 | 105 | ||
85 | o A hardware failure. This is quite unlikely, but has occurred | 106 | o A hardware failure. This is quite unlikely, but has occurred |