diff options
Diffstat (limited to 'Documentation/workqueue.txt')
-rw-r--r-- | Documentation/workqueue.txt | 73 |
1 files changed, 57 insertions, 16 deletions
diff --git a/Documentation/workqueue.txt b/Documentation/workqueue.txt index e4498a2872c3..a0b577de918f 100644 --- a/Documentation/workqueue.txt +++ b/Documentation/workqueue.txt | |||
@@ -12,6 +12,7 @@ CONTENTS | |||
12 | 4. Application Programming Interface (API) | 12 | 4. Application Programming Interface (API) |
13 | 5. Example Execution Scenarios | 13 | 5. Example Execution Scenarios |
14 | 6. Guidelines | 14 | 6. Guidelines |
15 | 7. Debugging | ||
15 | 16 | ||
16 | 17 | ||
17 | 1. Introduction | 18 | 1. Introduction |
@@ -190,17 +191,17 @@ resources, scheduled and executed. | |||
190 | * Long running CPU intensive workloads which can be better | 191 | * Long running CPU intensive workloads which can be better |
191 | managed by the system scheduler. | 192 | managed by the system scheduler. |
192 | 193 | ||
193 | WQ_FREEZEABLE | 194 | WQ_FREEZABLE |
194 | 195 | ||
195 | A freezeable wq participates in the freeze phase of the system | 196 | A freezable wq participates in the freeze phase of the system |
196 | suspend operations. Work items on the wq are drained and no | 197 | suspend operations. Work items on the wq are drained and no |
197 | new work item starts execution until thawed. | 198 | new work item starts execution until thawed. |
198 | 199 | ||
199 | WQ_RESCUER | 200 | WQ_MEM_RECLAIM |
200 | 201 | ||
201 | All wq which might be used in the memory reclaim paths _MUST_ | 202 | All wq which might be used in the memory reclaim paths _MUST_ |
202 | have this flag set. This reserves one worker exclusively for | 203 | have this flag set. The wq is guaranteed to have at least one |
203 | the execution of this wq under memory pressure. | 204 | execution context regardless of memory pressure. |
204 | 205 | ||
205 | WQ_HIGHPRI | 206 | WQ_HIGHPRI |
206 | 207 | ||
@@ -356,11 +357,11 @@ If q1 has WQ_CPU_INTENSIVE set, | |||
356 | 357 | ||
357 | 6. Guidelines | 358 | 6. Guidelines |
358 | 359 | ||
359 | * Do not forget to use WQ_RESCUER if a wq may process work items which | 360 | * Do not forget to use WQ_MEM_RECLAIM if a wq may process work items |
360 | are used during memory reclaim. Each wq with WQ_RESCUER set has one | 361 | which are used during memory reclaim. Each wq with WQ_MEM_RECLAIM |
361 | rescuer thread reserved for it. If there is dependency among | 362 | set has an execution context reserved for it. If there is |
362 | multiple work items used during memory reclaim, they should be | 363 | dependency among multiple work items used during memory reclaim, |
363 | queued to separate wq each with WQ_RESCUER. | 364 | they should be queued to separate wq each with WQ_MEM_RECLAIM. |
364 | 365 | ||
365 | * Unless strict ordering is required, there is no need to use ST wq. | 366 | * Unless strict ordering is required, there is no need to use ST wq. |
366 | 367 | ||
@@ -368,13 +369,53 @@ If q1 has WQ_CPU_INTENSIVE set, | |||
368 | recommended. In most use cases, concurrency level usually stays | 369 | recommended. In most use cases, concurrency level usually stays |
369 | well under the default limit. | 370 | well under the default limit. |
370 | 371 | ||
371 | * A wq serves as a domain for forward progress guarantee (WQ_RESCUER), | 372 | * A wq serves as a domain for forward progress guarantee |
372 | flush and work item attributes. Work items which are not involved | 373 | (WQ_MEM_RECLAIM, flush and work item attributes. Work items which |
373 | in memory reclaim and don't need to be flushed as a part of a group | 374 | are not involved in memory reclaim and don't need to be flushed as a |
374 | of work items, and don't require any special attribute, can use one | 375 | part of a group of work items, and don't require any special |
375 | of the system wq. There is no difference in execution | 376 | attribute, can use one of the system wq. There is no difference in |
376 | characteristics between using a dedicated wq and a system wq. | 377 | execution characteristics between using a dedicated wq and a system |
378 | wq. | ||
377 | 379 | ||
378 | * Unless work items are expected to consume a huge amount of CPU | 380 | * Unless work items are expected to consume a huge amount of CPU |
379 | cycles, using a bound wq is usually beneficial due to the increased | 381 | cycles, using a bound wq is usually beneficial due to the increased |
380 | level of locality in wq operations and work item execution. | 382 | level of locality in wq operations and work item execution. |
383 | |||
384 | |||
385 | 7. Debugging | ||
386 | |||
387 | Because the work functions are executed by generic worker threads | ||
388 | there are a few tricks needed to shed some light on misbehaving | ||
389 | workqueue users. | ||
390 | |||
391 | Worker threads show up in the process list as: | ||
392 | |||
393 | root 5671 0.0 0.0 0 0 ? S 12:07 0:00 [kworker/0:1] | ||
394 | root 5672 0.0 0.0 0 0 ? S 12:07 0:00 [kworker/1:2] | ||
395 | root 5673 0.0 0.0 0 0 ? S 12:12 0:00 [kworker/0:0] | ||
396 | root 5674 0.0 0.0 0 0 ? S 12:13 0:00 [kworker/1:0] | ||
397 | |||
398 | If kworkers are going crazy (using too much cpu), there are two types | ||
399 | of possible problems: | ||
400 | |||
401 | 1. Something beeing scheduled in rapid succession | ||
402 | 2. A single work item that consumes lots of cpu cycles | ||
403 | |||
404 | The first one can be tracked using tracing: | ||
405 | |||
406 | $ echo workqueue:workqueue_queue_work > /sys/kernel/debug/tracing/set_event | ||
407 | $ cat /sys/kernel/debug/tracing/trace_pipe > out.txt | ||
408 | (wait a few secs) | ||
409 | ^C | ||
410 | |||
411 | If something is busy looping on work queueing, it would be dominating | ||
412 | the output and the offender can be determined with the work item | ||
413 | function. | ||
414 | |||
415 | For the second type of problems it should be possible to just check | ||
416 | the stack trace of the offending worker thread. | ||
417 | |||
418 | $ cat /proc/THE_OFFENDING_KWORKER/stack | ||
419 | |||
420 | The work item's function should be trivially visible in the stack | ||
421 | trace. | ||