diff options
author | Linus Torvalds <torvalds@linux-foundation.org> | 2013-11-14 02:30:30 -0500 |
---|---|---|
committer | Linus Torvalds <torvalds@linux-foundation.org> | 2013-11-14 02:30:30 -0500 |
commit | 5e30025a319910695f5010dc0fb53a23299da14d (patch) | |
tree | 4292bcf78de221c7de1774ccf5ad0ac5a9315c26 /Documentation | |
parent | 7971e23a66c94f1b9bd2d64a3e86dfbfa8c60121 (diff) | |
parent | 90d3839b90fe379557dae4a44735a6af78f42885 (diff) |
Merge branch 'core-locking-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip
Pull core locking changes from Ingo Molnar:
"The biggest changes:
- add lockdep support for seqcount/seqlocks structures, this
unearthed both bugs and required extra annotation.
- move the various kernel locking primitives to the new
kernel/locking/ directory"
* 'core-locking-for-linus' of git://git.kernel.org/pub/scm/linux/kernel/git/tip/tip: (21 commits)
block: Use u64_stats_init() to initialize seqcounts
locking/lockdep: Mark __lockdep_count_forward_deps() as static
lockdep/proc: Fix lock-time avg computation
locking/doc: Update references to kernel/mutex.c
ipv6: Fix possible ipv6 seqlock deadlock
cpuset: Fix potential deadlock w/ set_mems_allowed
seqcount: Add lockdep functionality to seqcount/seqlock structures
net: Explicitly initialize u64_stats_sync structures for lockdep
locking: Move the percpu-rwsem code to kernel/locking/
locking: Move the lglocks code to kernel/locking/
locking: Move the rwsem code to kernel/locking/
locking: Move the rtmutex code to kernel/locking/
locking: Move the semaphore core to kernel/locking/
locking: Move the spinlock code to kernel/locking/
locking: Move the lockdep code to kernel/locking/
locking: Move the mutex code to kernel/locking/
hung_task debugging: Add tracepoint to report the hang
x86/locking/kconfig: Update paravirt spinlock Kconfig description
lockstat: Report avg wait and hold times
lockdep, x86/alternatives: Drop ancient lockdep fixup message
...
Diffstat (limited to 'Documentation')
-rw-r--r-- | Documentation/DocBook/kernel-locking.tmpl | 2 | ||||
-rw-r--r-- | Documentation/lockstat.txt | 123 | ||||
-rw-r--r-- | Documentation/mutex-design.txt | 10 |
3 files changed, 67 insertions, 68 deletions
diff --git a/Documentation/DocBook/kernel-locking.tmpl b/Documentation/DocBook/kernel-locking.tmpl index 09e884e5b9f5..19f2a5a5a5b4 100644 --- a/Documentation/DocBook/kernel-locking.tmpl +++ b/Documentation/DocBook/kernel-locking.tmpl | |||
@@ -1958,7 +1958,7 @@ machines due to caching. | |||
1958 | <chapter id="apiref-mutex"> | 1958 | <chapter id="apiref-mutex"> |
1959 | <title>Mutex API reference</title> | 1959 | <title>Mutex API reference</title> |
1960 | !Iinclude/linux/mutex.h | 1960 | !Iinclude/linux/mutex.h |
1961 | !Ekernel/mutex.c | 1961 | !Ekernel/locking/mutex.c |
1962 | </chapter> | 1962 | </chapter> |
1963 | 1963 | ||
1964 | <chapter id="apiref-futex"> | 1964 | <chapter id="apiref-futex"> |
diff --git a/Documentation/lockstat.txt b/Documentation/lockstat.txt index dd2f7b26ca30..72d010689751 100644 --- a/Documentation/lockstat.txt +++ b/Documentation/lockstat.txt | |||
@@ -46,16 +46,14 @@ With these hooks we provide the following statistics: | |||
46 | contentions - number of lock acquisitions that had to wait | 46 | contentions - number of lock acquisitions that had to wait |
47 | wait time min - shortest (non-0) time we ever had to wait for a lock | 47 | wait time min - shortest (non-0) time we ever had to wait for a lock |
48 | max - longest time we ever had to wait for a lock | 48 | max - longest time we ever had to wait for a lock |
49 | total - total time we spend waiting on this lock | 49 | total - total time we spend waiting on this lock |
50 | avg - average time spent waiting on this lock | ||
50 | acq-bounces - number of lock acquisitions that involved x-cpu data | 51 | acq-bounces - number of lock acquisitions that involved x-cpu data |
51 | acquisitions - number of times we took the lock | 52 | acquisitions - number of times we took the lock |
52 | hold time min - shortest (non-0) time we ever held the lock | 53 | hold time min - shortest (non-0) time we ever held the lock |
53 | max - longest time we ever held the lock | 54 | max - longest time we ever held the lock |
54 | total - total time this lock was held | 55 | total - total time this lock was held |
55 | 56 | avg - average time this lock was held | |
56 | From these number various other statistics can be derived, such as: | ||
57 | |||
58 | hold time average = hold time total / acquisitions | ||
59 | 57 | ||
60 | These numbers are gathered per lock class, per read/write state (when | 58 | These numbers are gathered per lock class, per read/write state (when |
61 | applicable). | 59 | applicable). |
@@ -84,37 +82,38 @@ Look at the current lock statistics: | |||
84 | 82 | ||
85 | # less /proc/lock_stat | 83 | # less /proc/lock_stat |
86 | 84 | ||
87 | 01 lock_stat version 0.3 | 85 | 01 lock_stat version 0.4 |
88 | 02 ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | 86 | 02----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
89 | 03 class name con-bounces contentions waittime-min waittime-max waittime-total acq-bounces acquisitions holdtime-min holdtime-max holdtime-total | 87 | 03 class name con-bounces contentions waittime-min waittime-max waittime-total waittime-avg acq-bounces acquisitions holdtime-min holdtime-max holdtime-total holdtime-avg |
90 | 04 ----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- | 88 | 04----------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------------- |
91 | 05 | 89 | 05 |
92 | 06 &mm->mmap_sem-W: 233 538 18446744073708 22924.27 607243.51 1342 45806 1.71 8595.89 1180582.34 | 90 | 06 &mm->mmap_sem-W: 46 84 0.26 939.10 16371.53 194.90 47291 2922365 0.16 2220301.69 17464026916.32 5975.99 |
93 | 07 &mm->mmap_sem-R: 205 587 18446744073708 28403.36 731975.00 1940 412426 0.58 187825.45 6307502.88 | 91 | 07 &mm->mmap_sem-R: 37 100 1.31 299502.61 325629.52 3256.30 212344 34316685 0.10 7744.91 95016910.20 2.77 |
94 | 08 --------------- | 92 | 08 --------------- |
95 | 09 &mm->mmap_sem 487 [<ffffffff8053491f>] do_page_fault+0x466/0x928 | 93 | 09 &mm->mmap_sem 1 [<ffffffff811502a7>] khugepaged_scan_mm_slot+0x57/0x280 |
96 | 10 &mm->mmap_sem 179 [<ffffffff802a6200>] sys_mprotect+0xcd/0x21d | 94 | 19 &mm->mmap_sem 96 [<ffffffff815351c4>] __do_page_fault+0x1d4/0x510 |
97 | 11 &mm->mmap_sem 279 [<ffffffff80210a57>] sys_mmap+0x75/0xce | 95 | 11 &mm->mmap_sem 34 [<ffffffff81113d77>] vm_mmap_pgoff+0x87/0xd0 |
98 | 12 &mm->mmap_sem 76 [<ffffffff802a490b>] sys_munmap+0x32/0x59 | 96 | 12 &mm->mmap_sem 17 [<ffffffff81127e71>] vm_munmap+0x41/0x80 |
99 | 13 --------------- | 97 | 13 --------------- |
100 | 14 &mm->mmap_sem 270 [<ffffffff80210a57>] sys_mmap+0x75/0xce | 98 | 14 &mm->mmap_sem 1 [<ffffffff81046fda>] dup_mmap+0x2a/0x3f0 |
101 | 15 &mm->mmap_sem 431 [<ffffffff8053491f>] do_page_fault+0x466/0x928 | 99 | 15 &mm->mmap_sem 60 [<ffffffff81129e29>] SyS_mprotect+0xe9/0x250 |
102 | 16 &mm->mmap_sem 138 [<ffffffff802a490b>] sys_munmap+0x32/0x59 | 100 | 16 &mm->mmap_sem 41 [<ffffffff815351c4>] __do_page_fault+0x1d4/0x510 |
103 | 17 &mm->mmap_sem 145 [<ffffffff802a6200>] sys_mprotect+0xcd/0x21d | 101 | 17 &mm->mmap_sem 68 [<ffffffff81113d77>] vm_mmap_pgoff+0x87/0xd0 |
104 | 18 | 102 | 18 |
105 | 19 ............................................................................................................................................................................................... | 103 | 19............................................................................................................................................................................................................................. |
106 | 20 | 104 | 20 |
107 | 21 dcache_lock: 621 623 0.52 118.26 1053.02 6745 91930 0.29 316.29 118423.41 | 105 | 21 unix_table_lock: 110 112 0.21 49.24 163.91 1.46 21094 66312 0.12 624.42 31589.81 0.48 |
108 | 22 ----------- | 106 | 22 --------------- |
109 | 23 dcache_lock 179 [<ffffffff80378274>] _atomic_dec_and_lock+0x34/0x54 | 107 | 23 unix_table_lock 45 [<ffffffff8150ad8e>] unix_create1+0x16e/0x1b0 |
110 | 24 dcache_lock 113 [<ffffffff802cc17b>] d_alloc+0x19a/0x1eb | 108 | 24 unix_table_lock 47 [<ffffffff8150b111>] unix_release_sock+0x31/0x250 |
111 | 25 dcache_lock 99 [<ffffffff802ca0dc>] d_rehash+0x1b/0x44 | 109 | 25 unix_table_lock 15 [<ffffffff8150ca37>] unix_find_other+0x117/0x230 |
112 | 26 dcache_lock 104 [<ffffffff802cbca0>] d_instantiate+0x36/0x8a | 110 | 26 unix_table_lock 5 [<ffffffff8150a09f>] unix_autobind+0x11f/0x1b0 |
113 | 27 ----------- | 111 | 27 --------------- |
114 | 28 dcache_lock 192 [<ffffffff80378274>] _atomic_dec_and_lock+0x34/0x54 | 112 | 28 unix_table_lock 39 [<ffffffff8150b111>] unix_release_sock+0x31/0x250 |
115 | 29 dcache_lock 98 [<ffffffff802ca0dc>] d_rehash+0x1b/0x44 | 113 | 29 unix_table_lock 49 [<ffffffff8150ad8e>] unix_create1+0x16e/0x1b0 |
116 | 30 dcache_lock 72 [<ffffffff802cc17b>] d_alloc+0x19a/0x1eb | 114 | 30 unix_table_lock 20 [<ffffffff8150ca37>] unix_find_other+0x117/0x230 |
117 | 31 dcache_lock 112 [<ffffffff802cbca0>] d_instantiate+0x36/0x8a | 115 | 31 unix_table_lock 4 [<ffffffff8150a09f>] unix_autobind+0x11f/0x1b0 |
116 | |||
118 | 117 | ||
119 | This excerpt shows the first two lock class statistics. Line 01 shows the | 118 | This excerpt shows the first two lock class statistics. Line 01 shows the |
120 | output version - each time the format changes this will be updated. Line 02-04 | 119 | output version - each time the format changes this will be updated. Line 02-04 |
@@ -131,30 +130,30 @@ The integer part of the time values is in us. | |||
131 | 130 | ||
132 | Dealing with nested locks, subclasses may appear: | 131 | Dealing with nested locks, subclasses may appear: |
133 | 132 | ||
134 | 32............................................................................................................................................................................................... | 133 | 32........................................................................................................................................................................................................................... |
135 | 33 | 134 | 33 |
136 | 34 &rq->lock: 13128 13128 0.43 190.53 103881.26 97454 3453404 0.00 401.11 13224683.11 | 135 | 34 &rq->lock: 13128 13128 0.43 190.53 103881.26 7.91 97454 3453404 0.00 401.11 13224683.11 3.82 |
137 | 35 --------- | 136 | 35 --------- |
138 | 36 &rq->lock 645 [<ffffffff8103bfc4>] task_rq_lock+0x43/0x75 | 137 | 36 &rq->lock 645 [<ffffffff8103bfc4>] task_rq_lock+0x43/0x75 |
139 | 37 &rq->lock 297 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a | 138 | 37 &rq->lock 297 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a |
140 | 38 &rq->lock 360 [<ffffffff8103c4c5>] select_task_rq_fair+0x1f0/0x74a | 139 | 38 &rq->lock 360 [<ffffffff8103c4c5>] select_task_rq_fair+0x1f0/0x74a |
141 | 39 &rq->lock 428 [<ffffffff81045f98>] scheduler_tick+0x46/0x1fb | 140 | 39 &rq->lock 428 [<ffffffff81045f98>] scheduler_tick+0x46/0x1fb |
142 | 40 --------- | 141 | 40 --------- |
143 | 41 &rq->lock 77 [<ffffffff8103bfc4>] task_rq_lock+0x43/0x75 | 142 | 41 &rq->lock 77 [<ffffffff8103bfc4>] task_rq_lock+0x43/0x75 |
144 | 42 &rq->lock 174 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a | 143 | 42 &rq->lock 174 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a |
145 | 43 &rq->lock 4715 [<ffffffff8103ed4b>] double_rq_lock+0x42/0x54 | 144 | 43 &rq->lock 4715 [<ffffffff8103ed4b>] double_rq_lock+0x42/0x54 |
146 | 44 &rq->lock 893 [<ffffffff81340524>] schedule+0x157/0x7b8 | 145 | 44 &rq->lock 893 [<ffffffff81340524>] schedule+0x157/0x7b8 |
147 | 45 | 146 | 45 |
148 | 46............................................................................................................................................................................................... | 147 | 46........................................................................................................................................................................................................................... |
149 | 47 | 148 | 47 |
150 | 48 &rq->lock/1: 11526 11488 0.33 388.73 136294.31 21461 38404 0.00 37.93 109388.53 | 149 | 48 &rq->lock/1: 1526 11488 0.33 388.73 136294.31 11.86 21461 38404 0.00 37.93 109388.53 2.84 |
151 | 49 ----------- | 150 | 49 ----------- |
152 | 50 &rq->lock/1 11526 [<ffffffff8103ed58>] double_rq_lock+0x4f/0x54 | 151 | 50 &rq->lock/1 11526 [<ffffffff8103ed58>] double_rq_lock+0x4f/0x54 |
153 | 51 ----------- | 152 | 51 ----------- |
154 | 52 &rq->lock/1 5645 [<ffffffff8103ed4b>] double_rq_lock+0x42/0x54 | 153 | 52 &rq->lock/1 5645 [<ffffffff8103ed4b>] double_rq_lock+0x42/0x54 |
155 | 53 &rq->lock/1 1224 [<ffffffff81340524>] schedule+0x157/0x7b8 | 154 | 53 &rq->lock/1 1224 [<ffffffff81340524>] schedule+0x157/0x7b8 |
156 | 54 &rq->lock/1 4336 [<ffffffff8103ed58>] double_rq_lock+0x4f/0x54 | 155 | 54 &rq->lock/1 4336 [<ffffffff8103ed58>] double_rq_lock+0x4f/0x54 |
157 | 55 &rq->lock/1 181 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a | 156 | 55 &rq->lock/1 181 [<ffffffff8104ba65>] try_to_wake_up+0x127/0x25a |
158 | 157 | ||
159 | Line 48 shows statistics for the second subclass (/1) of &rq->lock class | 158 | Line 48 shows statistics for the second subclass (/1) of &rq->lock class |
160 | (subclass starts from 0), since in this case, as line 50 suggests, | 159 | (subclass starts from 0), since in this case, as line 50 suggests, |
@@ -163,16 +162,16 @@ double_rq_lock actually acquires a nested lock of two spinlocks. | |||
163 | View the top contending locks: | 162 | View the top contending locks: |
164 | 163 | ||
165 | # grep : /proc/lock_stat | head | 164 | # grep : /proc/lock_stat | head |
166 | &inode->i_data.tree_lock-W: 15 21657 0.18 1093295.30 11547131054.85 58 10415 0.16 87.51 6387.60 | 165 | clockevents_lock: 2926159 2947636 0.15 46882.81 1784540466.34 605.41 3381345 3879161 0.00 2260.97 53178395.68 13.71 |
167 | &inode->i_data.tree_lock-R: 0 0 0.00 0.00 0.00 23302 231198 0.25 8.45 98023.38 | 166 | tick_broadcast_lock: 346460 346717 0.18 2257.43 39364622.71 113.54 3642919 4242696 0.00 2263.79 49173646.60 11.59 |
168 | dcache_lock: 1037 1161 0.38 45.32 774.51 6611 243371 0.15 306.48 77387.24 | 167 | &mapping->i_mmap_mutex: 203896 203899 3.36 645530.05 31767507988.39 155800.21 3361776 8893984 0.17 2254.15 14110121.02 1.59 |
169 | &inode->i_mutex: 161 286 18446744073709 62882.54 1244614.55 3653 20598 18446744073709 62318.60 1693822.74 | 168 | &rq->lock: 135014 136909 0.18 606.09 842160.68 6.15 1540728 10436146 0.00 728.72 17606683.41 1.69 |
170 | &zone->lru_lock: 94 94 0.53 7.33 92.10 4366 32690 0.29 59.81 16350.06 | 169 | &(&zone->lru_lock)->rlock: 93000 94934 0.16 59.18 188253.78 1.98 1199912 3809894 0.15 391.40 3559518.81 0.93 |
171 | &inode->i_data.i_mmap_mutex: 79 79 0.40 3.77 53.03 11779 87755 0.28 116.93 29898.44 | 170 | tasklist_lock-W: 40667 41130 0.23 1189.42 428980.51 10.43 270278 510106 0.16 653.51 3939674.91 7.72 |
172 | &q->__queue_lock: 48 50 0.52 31.62 86.31 774 13131 0.17 113.08 12277.52 | 171 | tasklist_lock-R: 21298 21305 0.20 1310.05 215511.12 10.12 186204 241258 0.14 1162.33 1179779.23 4.89 |
173 | &rq->rq_lock_key: 43 47 0.74 68.50 170.63 3706 33929 0.22 107.99 17460.62 | 172 | rcu_node_1: 47656 49022 0.16 635.41 193616.41 3.95 844888 1865423 0.00 764.26 1656226.96 0.89 |
174 | &rq->rq_lock_key#2: 39 46 0.75 6.68 49.03 2979 32292 0.17 125.17 17137.63 | 173 | &(&dentry->d_lockref.lock)->rlock: 39791 40179 0.15 1302.08 88851.96 2.21 2790851 12527025 0.10 1910.75 3379714.27 0.27 |
175 | tasklist_lock-W: 15 15 1.45 10.87 32.70 1201 7390 0.58 62.55 13648.47 | 174 | rcu_node_0: 29203 30064 0.16 786.55 1555573.00 51.74 88963 244254 0.00 398.87 428872.51 1.76 |
176 | 175 | ||
177 | Clear the statistics: | 176 | Clear the statistics: |
178 | 177 | ||
diff --git a/Documentation/mutex-design.txt b/Documentation/mutex-design.txt index 38c10fd7f411..1dfe62c3641d 100644 --- a/Documentation/mutex-design.txt +++ b/Documentation/mutex-design.txt | |||
@@ -116,11 +116,11 @@ using mutexes at the moment, please let me know if you find any. ] | |||
116 | Implementation of mutexes | 116 | Implementation of mutexes |
117 | ------------------------- | 117 | ------------------------- |
118 | 118 | ||
119 | 'struct mutex' is the new mutex type, defined in include/linux/mutex.h | 119 | 'struct mutex' is the new mutex type, defined in include/linux/mutex.h and |
120 | and implemented in kernel/mutex.c. It is a counter-based mutex with a | 120 | implemented in kernel/locking/mutex.c. It is a counter-based mutex with a |
121 | spinlock and a wait-list. The counter has 3 states: 1 for "unlocked", | 121 | spinlock and a wait-list. The counter has 3 states: 1 for "unlocked", 0 for |
122 | 0 for "locked" and negative numbers (usually -1) for "locked, potential | 122 | "locked" and negative numbers (usually -1) for "locked, potential waiters |
123 | waiters queued". | 123 | queued". |
124 | 124 | ||
125 | the APIs of 'struct mutex' have been streamlined: | 125 | the APIs of 'struct mutex' have been streamlined: |
126 | 126 | ||