diff options
author | David Howells <dhowells@redhat.com> | 2006-04-11 01:54:23 -0400 |
---|---|---|
committer | Linus Torvalds <torvalds@g5.osdl.org> | 2006-04-11 09:18:44 -0400 |
commit | dbc8700e27a94621de9d22c506c67913e0121501 (patch) | |
tree | 4ac767de021735f8826ac849ac8a3795bd78ade5 | |
parent | 235963b2edc080b577000031b9ad75804dd83c88 (diff) |
[PATCH] Fix memory barrier docs wrt atomic ops
Fix the memory barrier documentation to attempt to describe atomic ops
correctly.
atomic_t ops that return a value _do_ imply smp_mb() either side, and so
don't actually require smp_mb__*_atomic_*() special barriers.
Also explains why special barriers exist in addition to normal barriers.
Further fix the memory barrier documents to portray bitwise operation
memory barrier effects correctly following Nick Piggin's comments.
It makes the point that any atomic op that both modifies some state in
memory and returns information on that state implies memory barriers on
both sides.
Signed-off-by: David Howells <dhowells@redhat.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
-rw-r--r-- | Documentation/memory-barriers.txt | 52 |
1 files changed, 33 insertions, 19 deletions
diff --git a/Documentation/memory-barriers.txt b/Documentation/memory-barriers.txt index f8550310a6d5..528d52f52eeb 100644 --- a/Documentation/memory-barriers.txt +++ b/Documentation/memory-barriers.txt | |||
@@ -829,8 +829,8 @@ There are some more advanced barrier functions: | |||
829 | (*) smp_mb__after_atomic_inc(); | 829 | (*) smp_mb__after_atomic_inc(); |
830 | 830 | ||
831 | These are for use with atomic add, subtract, increment and decrement | 831 | These are for use with atomic add, subtract, increment and decrement |
832 | functions, especially when used for reference counting. These functions | 832 | functions that don't return a value, especially when used for reference |
833 | do not imply memory barriers. | 833 | counting. These functions do not imply memory barriers. |
834 | 834 | ||
835 | As an example, consider a piece of code that marks an object as being dead | 835 | As an example, consider a piece of code that marks an object as being dead |
836 | and then decrements the object's reference count: | 836 | and then decrements the object's reference count: |
@@ -1263,15 +1263,17 @@ else. | |||
1263 | ATOMIC OPERATIONS | 1263 | ATOMIC OPERATIONS |
1264 | ----------------- | 1264 | ----------------- |
1265 | 1265 | ||
1266 | Though they are technically interprocessor interaction considerations, atomic | 1266 | Whilst they are technically interprocessor interaction considerations, atomic |
1267 | operations are noted specially as they do _not_ generally imply memory | 1267 | operations are noted specially as some of them imply full memory barriers and |
1268 | barriers. The possible offenders include: | 1268 | some don't, but they're very heavily relied on as a group throughout the |
1269 | kernel. | ||
1270 | |||
1271 | Any atomic operation that modifies some state in memory and returns information | ||
1272 | about the state (old or new) implies an SMP-conditional general memory barrier | ||
1273 | (smp_mb()) on each side of the actual operation. These include: | ||
1269 | 1274 | ||
1270 | xchg(); | 1275 | xchg(); |
1271 | cmpxchg(); | 1276 | cmpxchg(); |
1272 | test_and_set_bit(); | ||
1273 | test_and_clear_bit(); | ||
1274 | test_and_change_bit(); | ||
1275 | atomic_cmpxchg(); | 1277 | atomic_cmpxchg(); |
1276 | atomic_inc_return(); | 1278 | atomic_inc_return(); |
1277 | atomic_dec_return(); | 1279 | atomic_dec_return(); |
@@ -1282,21 +1284,31 @@ barriers. The possible offenders include: | |||
1282 | atomic_sub_and_test(); | 1284 | atomic_sub_and_test(); |
1283 | atomic_add_negative(); | 1285 | atomic_add_negative(); |
1284 | atomic_add_unless(); | 1286 | atomic_add_unless(); |
1287 | test_and_set_bit(); | ||
1288 | test_and_clear_bit(); | ||
1289 | test_and_change_bit(); | ||
1290 | |||
1291 | These are used for such things as implementing LOCK-class and UNLOCK-class | ||
1292 | operations and adjusting reference counters towards object destruction, and as | ||
1293 | such the implicit memory barrier effects are necessary. | ||
1285 | 1294 | ||
1286 | These may be used for such things as implementing LOCK operations or controlling | ||
1287 | the lifetime of objects by decreasing their reference counts. In such cases | ||
1288 | they need preceding memory barriers. | ||
1289 | 1295 | ||
1290 | The following may also be possible offenders as they may be used as UNLOCK | 1296 | The following operation are potential problems as they do _not_ imply memory |
1291 | operations. | 1297 | barriers, but might be used for implementing such things as UNLOCK-class |
1298 | operations: | ||
1292 | 1299 | ||
1300 | atomic_set(); | ||
1293 | set_bit(); | 1301 | set_bit(); |
1294 | clear_bit(); | 1302 | clear_bit(); |
1295 | change_bit(); | 1303 | change_bit(); |
1296 | atomic_set(); | 1304 | |
1305 | With these the appropriate explicit memory barrier should be used if necessary | ||
1306 | (smp_mb__before_clear_bit() for instance). | ||
1297 | 1307 | ||
1298 | 1308 | ||
1299 | The following are a little tricky: | 1309 | The following also do _not_ imply memory barriers, and so may require explicit |
1310 | memory barriers under some circumstances (smp_mb__before_atomic_dec() for | ||
1311 | instance)): | ||
1300 | 1312 | ||
1301 | atomic_add(); | 1313 | atomic_add(); |
1302 | atomic_sub(); | 1314 | atomic_sub(); |
@@ -1317,10 +1329,12 @@ specific order. | |||
1317 | 1329 | ||
1318 | 1330 | ||
1319 | Basically, each usage case has to be carefully considered as to whether memory | 1331 | Basically, each usage case has to be carefully considered as to whether memory |
1320 | barriers are needed or not. The simplest rule is probably: if the atomic | 1332 | barriers are needed or not. |
1321 | operation is protected by a lock, then it does not require a barrier unless | 1333 | |
1322 | there's another operation within the critical section with respect to which an | 1334 | [!] Note that special memory barrier primitives are available for these |
1323 | ordering must be maintained. | 1335 | situations because on some CPUs the atomic instructions used imply full memory |
1336 | barriers, and so barrier instructions are superfluous in conjunction with them, | ||
1337 | and in such cases the special barrier primitives will be no-ops. | ||
1324 | 1338 | ||
1325 | See Documentation/atomic_ops.txt for more information. | 1339 | See Documentation/atomic_ops.txt for more information. |
1326 | 1340 | ||