rcu: 1Q2010 update for RCU documentation

Add expedited functions. Review documentation and update obsolete verbiage. Also fix the advice for the RCU CPU-stall kernel configuration parameter, and document RCU CPU-stall warnings. Signed-off-by: Paul E. McKenney <paulmck@linux.vnet.ibm.com> Cc: laijs@cn.fujitsu.com Cc: dipankar@in.ibm.com Cc: mathieu.desnoyers@polymtl.ca Cc: josh@joshtriplett.org Cc: dvhltc@us.ibm.com Cc: niv@us.ibm.com Cc: peterz@infradead.org Cc: rostedt@goodmis.org Cc: Valdis.Kletnieks@vt.edu Cc: dhowells@redhat.com LKML-Reference: <12635142581866-git-send-email-> Signed-off-by: Ingo Molnar <mingo@elte.hu>
author: Paul E. McKenney <paulmck@linux.vnet.ibm.com> 2010-01-14 19:10:57 -0500
committer: Ingo Molnar <mingo@elte.hu> 2010-01-16 04:25:22 -0500
commit: 4c54005ca438a8b46dd542b497d4f0dc2ca375e8 (patch)
tree: 4274fb9dcbd94480b93fecefcf83969db53461ba /Documentation/RCU/checklist.txt
parent: b6407e863934965cdc66cbc244d811ceeb6f4d77 (diff)
1 files changed, 119 insertions, 81 deletions
diff --git a/Documentation/RCU/checklist.txt b/Documentation/RCU/checklist.txt
index 51525a30e8b4..767cf06a4276 100644
--- a/Documentation/RCU/checklist.txt
+++ b/Documentation/RCU/checklist.txt
@@ -8,13 +8,12 @@ would cause.  This list is based on experiences reviewing such patches
 over a rather long period of time, but improvements are always welcome!
 0.      Is RCU being applied to a read-mostly situation?  If the data
-        structure is updated more than about 10% of the time, then
+        structure is updated more than about 10% of the time, then you
-        you should strongly consider some other approach, unless
+        should strongly consider some other approach, unless detailed
-        detailed performance measurements show that RCU is nonetheless
+        performance measurements show that RCU is nonetheless the right
-        the right tool for the job.  Yes, you might think of RCU
+        tool for the job.  Yes, RCU does reduce read-side overhead by
-        as simply cutting overhead off of the readers and imposing it
+        increasing write-side overhead, which is exactly why normal uses
-        on the writers.  That is exactly why normal uses of RCU will
+        of RCU will do much more reading than updating.
-        do much more reading than updating.
        Another exception is where performance is not an issue, and RCU
        provides a simpler implementation.  An example of this situation
@@ -35,13 +34,13 @@ over a rather long period of time, but improvements are always welcome!
        If you choose #b, be prepared to describe how you have handled
        memory barriers on weakly ordered machines (pretty much all of
-        them -- even x86 allows reads to be reordered), and be prepared
+        them -- even x86 allows later loads to be reordered to precede
-        to explain why this added complexity is worthwhile.  If you
+        earlier stores), and be prepared to explain why this added
-        choose #c, be prepared to explain how this single task does not
+        complexity is worthwhile.  If you choose #c, be prepared to
-        become a major bottleneck on big multiprocessor machines (for
+        explain how this single task does not become a major bottleneck on
-        example, if the task is updating information relating to itself
+        big multiprocessor machines (for example, if the task is updating
-        that other tasks can read, there by definition can be no
+        information relating to itself that other tasks can read, there
-        bottleneck).
+        by definition can be no bottleneck).
 2.      Do the RCU read-side critical sections make proper use of
        rcu_read_lock() and friends?  These primitives are needed
@@ -51,8 +50,10 @@ over a rather long period of time, but improvements are always welcome!
        actuarial risk of your kernel.
        As a rough rule of thumb, any dereference of an RCU-protected
-        pointer must be covered by rcu_read_lock() or rcu_read_lock_bh()
+        pointer must be covered by rcu_read_lock(), rcu_read_lock_bh(),
-        or by the appropriate update-side lock.
+        rcu_read_lock_sched(), or by the appropriate update-side lock.
+        Disabling of preemption can serve as rcu_read_lock_sched(), but
+        is less readable.
 3.      Does the update code tolerate concurrent accesses?
@@ -62,25 +63,27 @@ over a rather long period of time, but improvements are always welcome!
        of ways to handle this concurrency, depending on the situation:
        a.      Use the RCU variants of the list and hlist update
-                primitives to add, remove, and replace elements on an
+                primitives to add, remove, and replace elements on
-                RCU-protected list.  Alternatively, use the RCU-protected
+                an RCU-protected list.  Alternatively, use the other
-                trees that have been added to the Linux kernel.
+                RCU-protected data structures that have been added to
+                the Linux kernel.
                This is almost always the best approach.
        b.      Proceed as in (a) above, but also maintain per-element
                locks (that are acquired by both readers and writers)
                that guard per-element state.  Of course, fields that
-                the readers refrain from accessing can be guarded by the
+                the readers refrain from accessing can be guarded by
-                update-side lock.
+                some other lock acquired only by updaters, if desired.
                This works quite well, also.
        c.      Make updates appear atomic to readers.  For example,
-                pointer updates to properly aligned fields will appear
+                pointer updates to properly aligned fields will
-                atomic, as will individual atomic primitives.  Operations
+                appear atomic, as will individual atomic primitives.
-                performed under a lock and sequences of multiple atomic
+                Sequences of perations performed under a lock will -not-
-                primitives will -not- appear to be atomic.
+                appear to be atomic to RCU readers, nor will sequences
+                of multiple atomic primitives.
                This can work, but is starting to get a bit tricky.
@@ -98,9 +101,9 @@ over a rather long period of time, but improvements are always welcome!
                a new structure containing updated values.
 4.      Weakly ordered CPUs pose special challenges.  Almost all CPUs
-        are weakly ordered -- even i386 CPUs allow reads to be reordered.
+        are weakly ordered -- even x86 CPUs allow later loads to be
-        RCU code must take all of the following measures to prevent
+        reordered to precede earlier stores.  RCU code must take all of
-        memory-corruption problems:
+        the following measures to prevent memory-corruption problems:
        a.      Readers must maintain proper ordering of their memory
                accesses.  The rcu_dereference() primitive ensures that
@@ -113,14 +116,21 @@ over a rather long period of time, but improvements are always welcome!
                The rcu_dereference() primitive is also an excellent
                documentation aid, letting the person reading the code
                know exactly which pointers are protected by RCU.
+                Please note that compilers can also reorder code, and
-                The rcu_dereference() primitive is used by the various
+                they are becoming increasingly aggressive about doing
-                "_rcu()" list-traversal primitives, such as the
+                just that.  The rcu_dereference() primitive therefore
-                list_for_each_entry_rcu().  Note that it is perfectly
+                also prevents destructive compiler optimizations.
-                legal (if redundant) for update-side code to use
-                rcu_dereference() and the "_rcu()" list-traversal
+                The rcu_dereference() primitive is used by the
-                primitives.  This is particularly useful in code
+                various "_rcu()" list-traversal primitives, such
-                that is common to readers and updaters.
+                as the list_for_each_entry_rcu().  Note that it is
+                perfectly legal (if redundant) for update-side code to
+                use rcu_dereference() and the "_rcu()" list-traversal
+                primitives.  This is particularly useful in code that
+                is common to readers and updaters.  However, neither
+                rcu_dereference() nor the "_rcu()" list-traversal
+                primitives can substitute for a good concurrency design
+                coordinating among multiple updaters.
        b.      If the list macros are being used, the list_add_tail_rcu()
                and list_add_rcu() primitives must be used in order
@@ -135,11 +145,14 @@ over a rather long period of time, but improvements are always welcome!
                readers.  Similarly, if the hlist macros are being used,
                the hlist_del_rcu() primitive is required.
-                The list_replace_rcu() primitive may be used to
+                The list_replace_rcu() and hlist_replace_rcu() primitives
-                replace an old structure with a new one in an
+                may be used to replace an old structure with a new one
-                RCU-protected list.
+                in their respective types of RCU-protected lists.
+        d.      Rules similar to (4b) and (4c) apply to the "hlist_nulls"
+                type of RCU-protected linked lists.
-        d.      Updates must ensure that initialization of a given
+        e.      Updates must ensure that initialization of a given
                structure happens before pointers to that structure are
                publicized.  Use the rcu_assign_pointer() primitive
                when publicizing a pointer to a structure that can
@@ -151,16 +164,31 @@ over a rather long period of time, but improvements are always welcome!
        it cannot block.
 6.      Since synchronize_rcu() can block, it cannot be called from
-        any sort of irq context.  Ditto for synchronize_sched() and
+        any sort of irq context.  The same rule applies for
-        synchronize_srcu().
+        synchronize_rcu_bh(), synchronize_sched(), synchronize_srcu(),
+        synchronize_rcu_expedited(), synchronize_rcu_bh_expedited(),
-7.      If the updater uses call_rcu(), then the corresponding readers
+        synchronize_sched_expedite(), and synchronize_srcu_expedited().
-        must use rcu_read_lock() and rcu_read_unlock().  If the updater
-        uses call_rcu_bh(), then the corresponding readers must use
+        The expedited forms of these primitives have the same semantics
-        rcu_read_lock_bh() and rcu_read_unlock_bh().  If the updater
+        as the non-expedited forms, but expediting is both expensive
-        uses call_rcu_sched(), then the corresponding readers must
+        and unfriendly to real-time workloads.  Use of the expedited
-        disable preemption.  Mixing things up will result in confusion
+        primitives should be restricted to rare configuration-change
-        and broken kernels.
+        operations that would not normally be undertaken while a real-time
+        workload is running.
+7.      If the updater uses call_rcu() or synchronize_rcu(), then the
+        corresponding readers must use rcu_read_lock() and
+        rcu_read_unlock().  If the updater uses call_rcu_bh() or
+        synchronize_rcu_bh(), then the corresponding readers must
+        use rcu_read_lock_bh() and rcu_read_unlock_bh().  If the
+        updater uses call_rcu_sched() or synchronize_sched(), then
+        the corresponding readers must disable preemption, possibly
+        by calling rcu_read_lock_sched() and rcu_read_unlock_sched().
+        If the updater uses synchronize_srcu(), the the corresponding
+        readers must use srcu_read_lock() and srcu_read_unlock(),
+        and with the same srcu_struct.  The rules for the expedited
+        primitives are the same as for their non-expedited counterparts.
+        Mixing things up will result in confusion and broken kernels.
        One exception to this rule: rcu_read_lock() and rcu_read_unlock()
        may be substituted for rcu_read_lock_bh() and rcu_read_unlock_bh()
@@ -212,6 +240,8 @@ over a rather long period of time, but improvements are always welcome!
        e.      Periodically invoke synchronize_rcu(), permitting a limited
                number of updates per grace period.
+        The same cautions apply to call_rcu_bh() and call_rcu_sched().
 9.      All RCU list-traversal primitives, which include
        rcu_dereference(), list_for_each_entry_rcu(),
        list_for_each_continue_rcu(), and list_for_each_safe_rcu(),
@@ -229,7 +259,8 @@ over a rather long period of time, but improvements are always welcome!
 10.     Conversely, if you are in an RCU read-side critical section,
        and you don't hold the appropriate update-side lock, you -must-
        use the "_rcu()" variants of the list macros.  Failing to do so
-        will break Alpha and confuse people reading your code.
+        will break Alpha, cause aggressive compilers to generate bad code,
+        and confuse people trying to read your code.
 11.     Note that synchronize_rcu() -only- guarantees to wait until
        all currently executing rcu_read_lock()-protected RCU read-side
@@ -239,15 +270,21 @@ over a rather long period of time, but improvements are always welcome!
        rcu_read_lock()-protected read-side critical sections, do -not-
        use synchronize_rcu().
-        If you want to wait for some of these other things, you might
+        Similarly, disabling preemption is not an acceptable substitute
-        instead need to use synchronize_irq() or synchronize_sched().
+        for rcu_read_lock().  Code that attempts to use preemption
+        disabling where it should be using rcu_read_lock() will break
+        in real-time kernel builds.
+        If you want to wait for interrupt handlers, NMI handlers, and
+        code under the influence of preempt_disable(), you instead
+        need to use synchronize_irq() or synchronize_sched().
 12.     Any lock acquired by an RCU callback must be acquired elsewhere
        with softirq disabled, e.g., via spin_lock_irqsave(),
        spin_lock_bh(), etc.  Failing to disable irq on a given
-        acquisition of that lock will result in deadlock as soon as the
+        acquisition of that lock will result in deadlock as soon as
-        RCU callback happens to interrupt that acquisition's critical
+        the RCU softirq handler happens to run your RCU callback while
-        section.
+        interrupting that acquisition's critical section.
 13.     RCU callbacks can be and are executed in parallel.  In many cases,
        the callback code simply wrappers around kfree(), so that this
@@ -265,29 +302,30 @@ over a rather long period of time, but improvements are always welcome!
        not the case, a self-spawning RCU callback would prevent the
        victim CPU from ever going offline.)
-14.     SRCU (srcu_read_lock(), srcu_read_unlock(), and synchronize_srcu())
+14.     SRCU (srcu_read_lock(), srcu_read_unlock(), synchronize_srcu(),
-        may only be invoked from process context.  Unlike other forms of
+        and synchronize_srcu_expedited()) may only be invoked from
-        RCU, it -is- permissible to block in an SRCU read-side critical
+        process context.  Unlike other forms of RCU, it -is- permissible
-        section (demarked by srcu_read_lock() and srcu_read_unlock()),
+        to block in an SRCU read-side critical section (demarked by
-        hence the "SRCU": "sleepable RCU".  Please note that if you
+        srcu_read_lock() and srcu_read_unlock()), hence the "SRCU":
-        don't need to sleep in read-side critical sections, you should
+        "sleepable RCU".  Please note that if you don't need to sleep
-        be using RCU rather than SRCU, because RCU is almost always
+        in read-side critical sections, you should be using RCU rather
-        faster and easier to use than is SRCU.
+        than SRCU, because RCU is almost always faster and easier to
+        use than is SRCU.
        Also unlike other forms of RCU, explicit initialization
        and cleanup is required via init_srcu_struct() and
        cleanup_srcu_struct().  These are passed a "struct srcu_struct"
        that defines the scope of a given SRCU domain.  Once initialized,
        the srcu_struct is passed to srcu_read_lock(), srcu_read_unlock()
-        and synchronize_srcu().  A given synchronize_srcu() waits only
+        synchronize_srcu(), and synchronize_srcu_expedited().  A given
-        for SRCU read-side critical sections governed by srcu_read_lock()
+        synchronize_srcu() waits only for SRCU read-side critical
-        and srcu_read_unlock() calls that have been passd the same
+        sections governed by srcu_read_lock() and srcu_read_unlock()
-        srcu_struct.  This property is what makes sleeping read-side
+        calls that have been passed the same srcu_struct.  This property
-        critical sections tolerable -- a given subsystem delays only
+        is what makes sleeping read-side critical sections tolerable --
-        its own updates, not those of other subsystems using SRCU.
+        a given subsystem delays only its own updates, not those of other
-        Therefore, SRCU is less prone to OOM the system than RCU would
+        subsystems using SRCU.  Therefore, SRCU is less prone to OOM the
-        be if RCU's read-side critical sections were permitted to
+        system than RCU would be if RCU's read-side critical sections
-        sleep.
+        were permitted to sleep.
        The ability to sleep in read-side critical sections does not
        come for free.  First, corresponding srcu_read_lock() and
@@ -311,12 +349,12 @@ over a rather long period of time, but improvements are always welcome!
        destructive operation, and -only- -then- invoke call_rcu(),
        synchronize_rcu(), or friends.
-        Because these primitives only wait for pre-existing readers,
+        Because these primitives only wait for pre-existing readers, it
-        it is the caller's responsibility to guarantee safety to
+        is the caller's responsibility to guarantee that any subsequent
-        any subsequent readers.
+        readers will execute safely.
-16.     The various RCU read-side primitives do -not- contain memory
+16.     The various RCU read-side primitives do -not- necessarily contain
-        barriers.  The CPU (and in some cases, the compiler) is free
+        memory barriers.  You should therefore plan for the CPU
-        to reorder code into and out of RCU read-side critical sections.
+        and the compiler to freely reorder code into and out of RCU
-        It is the responsibility of the RCU update-side primitives to
+        read-side critical sections.  It is the responsibility of the
-        deal with this.
+        RCU update-side primitives to deal with this.
author	Paul E. McKenney <paulmck@linux.vnet.ibm.com>	2010-01-14 19:10:57 -0500
committer	Ingo Molnar <mingo@elte.hu>	2010-01-16 04:25:22 -0500
commit	4c54005ca438a8b46dd542b497d4f0dc2ca375e8 (patch)
tree	4274fb9dcbd94480b93fecefcf83969db53461ba /Documentation/RCU/checklist.txt
parent	b6407e863934965cdc66cbc244d811ceeb6f4d77 (diff)

diff --git a/Documentation/RCU/checklist.txt b/Documentation/RCU/checklist.txt index 51525a30e8b4..767cf06a4276 100644 --- a/Documentation/RCU/checklist.txt +++ b/Documentation/RCU/checklist.txt
@@ -8,13 +8,12 @@ would cause. This list is based on experiences reviewing such patches
8	over a rather long period of time, but improvements are always welcome!	8	over a rather long period of time, but improvements are always welcome!
9		9
10	0. Is RCU being applied to a read-mostly situation? If the data	10	0. Is RCU being applied to a read-mostly situation? If the data
11	structure is updated more than about 10% of the time, then	11	structure is updated more than about 10% of the time, then you
12	you should strongly consider some other approach, unless	12	should strongly consider some other approach, unless detailed
13	detailed performance measurements show that RCU is nonetheless	13	performance measurements show that RCU is nonetheless the right
14	the right tool for the job. Yes, you might think of RCU	14	tool for the job. Yes, RCU does reduce read-side overhead by
15	as simply cutting overhead off of the readers and imposing it	15	increasing write-side overhead, which is exactly why normal uses
16	on the writers. That is exactly why normal uses of RCU will	16	of RCU will do much more reading than updating.
17	do much more reading than updating.
18		17
19	Another exception is where performance is not an issue, and RCU	18	Another exception is where performance is not an issue, and RCU
20	provides a simpler implementation. An example of this situation	19	provides a simpler implementation. An example of this situation
@@ -35,13 +34,13 @@ over a rather long period of time, but improvements are always welcome!
35		34
36	If you choose #b, be prepared to describe how you have handled	35	If you choose #b, be prepared to describe how you have handled
37	memory barriers on weakly ordered machines (pretty much all of	36	memory barriers on weakly ordered machines (pretty much all of
38	them -- even x86 allows reads to be reordered), and be prepared	37	them -- even x86 allows later loads to be reordered to precede
39	to explain why this added complexity is worthwhile. If you	38	earlier stores), and be prepared to explain why this added
40	choose #c, be prepared to explain how this single task does not	39	complexity is worthwhile. If you choose #c, be prepared to
41	become a major bottleneck on big multiprocessor machines (for	40	explain how this single task does not become a major bottleneck on
42	example, if the task is updating information relating to itself	41	big multiprocessor machines (for example, if the task is updating
43	that other tasks can read, there by definition can be no	42	information relating to itself that other tasks can read, there
44	bottleneck).	43	by definition can be no bottleneck).
45		44
46	2. Do the RCU read-side critical sections make proper use of	45	2. Do the RCU read-side critical sections make proper use of
47	rcu_read_lock() and friends? These primitives are needed	46	rcu_read_lock() and friends? These primitives are needed
@@ -51,8 +50,10 @@ over a rather long period of time, but improvements are always welcome!
51	actuarial risk of your kernel.	50	actuarial risk of your kernel.
52		51
53	As a rough rule of thumb, any dereference of an RCU-protected	52	As a rough rule of thumb, any dereference of an RCU-protected
54	pointer must be covered by rcu_read_lock() or rcu_read_lock_bh()	53	pointer must be covered by rcu_read_lock(), rcu_read_lock_bh(),
55	or by the appropriate update-side lock.	54	rcu_read_lock_sched(), or by the appropriate update-side lock.
		55	Disabling of preemption can serve as rcu_read_lock_sched(), but
		56	is less readable.
56		57
57	3. Does the update code tolerate concurrent accesses?	58	3. Does the update code tolerate concurrent accesses?
58		59
@@ -62,25 +63,27 @@ over a rather long period of time, but improvements are always welcome!
62	of ways to handle this concurrency, depending on the situation:	63	of ways to handle this concurrency, depending on the situation:
63		64
64	a. Use the RCU variants of the list and hlist update	65	a. Use the RCU variants of the list and hlist update
65	primitives to add, remove, and replace elements on an	66	primitives to add, remove, and replace elements on
66	RCU-protected list. Alternatively, use the RCU-protected	67	an RCU-protected list. Alternatively, use the other
67	trees that have been added to the Linux kernel.	68	RCU-protected data structures that have been added to
		69	the Linux kernel.
68		70
69	This is almost always the best approach.	71	This is almost always the best approach.
70		72
71	b. Proceed as in (a) above, but also maintain per-element	73	b. Proceed as in (a) above, but also maintain per-element
72	locks (that are acquired by both readers and writers)	74	locks (that are acquired by both readers and writers)
73	that guard per-element state. Of course, fields that	75	that guard per-element state. Of course, fields that
74	the readers refrain from accessing can be guarded by the	76	the readers refrain from accessing can be guarded by
75	update-side lock.	77	some other lock acquired only by updaters, if desired.
76		78
77	This works quite well, also.	79	This works quite well, also.
78		80
79	c. Make updates appear atomic to readers. For example,	81	c. Make updates appear atomic to readers. For example,
80	pointer updates to properly aligned fields will appear	82	pointer updates to properly aligned fields will
81	atomic, as will individual atomic primitives. Operations	83	appear atomic, as will individual atomic primitives.
82	performed under a lock and sequences of multiple atomic	84	Sequences of perations performed under a lock will -not-
83	primitives will -not- appear to be atomic.	85	appear to be atomic to RCU readers, nor will sequences
		86	of multiple atomic primitives.
84		87
85	This can work, but is starting to get a bit tricky.	88	This can work, but is starting to get a bit tricky.
86		89
@@ -98,9 +101,9 @@ over a rather long period of time, but improvements are always welcome!
98	a new structure containing updated values.	101	a new structure containing updated values.
99		102
100	4. Weakly ordered CPUs pose special challenges. Almost all CPUs	103	4. Weakly ordered CPUs pose special challenges. Almost all CPUs
101	are weakly ordered -- even i386 CPUs allow reads to be reordered.	104	are weakly ordered -- even x86 CPUs allow later loads to be
102	RCU code must take all of the following measures to prevent	105	reordered to precede earlier stores. RCU code must take all of
103	memory-corruption problems:	106	the following measures to prevent memory-corruption problems:
104		107
105	a. Readers must maintain proper ordering of their memory	108	a. Readers must maintain proper ordering of their memory
106	accesses. The rcu_dereference() primitive ensures that	109	accesses. The rcu_dereference() primitive ensures that
@@ -113,14 +116,21 @@ over a rather long period of time, but improvements are always welcome!
113	The rcu_dereference() primitive is also an excellent	116	The rcu_dereference() primitive is also an excellent
114	documentation aid, letting the person reading the code	117	documentation aid, letting the person reading the code
115	know exactly which pointers are protected by RCU.	118	know exactly which pointers are protected by RCU.
116		119	Please note that compilers can also reorder code, and
117	The rcu_dereference() primitive is used by the various	120	they are becoming increasingly aggressive about doing
118	"_rcu()" list-traversal primitives, such as the	121	just that. The rcu_dereference() primitive therefore
119	list_for_each_entry_rcu(). Note that it is perfectly	122	also prevents destructive compiler optimizations.
120	legal (if redundant) for update-side code to use	123
121	rcu_dereference() and the "_rcu()" list-traversal	124	The rcu_dereference() primitive is used by the
122	primitives. This is particularly useful in code	125	various "_rcu()" list-traversal primitives, such
123	that is common to readers and updaters.	126	as the list_for_each_entry_rcu(). Note that it is
		127	perfectly legal (if redundant) for update-side code to
		128	use rcu_dereference() and the "_rcu()" list-traversal
		129	primitives. This is particularly useful in code that
		130	is common to readers and updaters. However, neither
		131	rcu_dereference() nor the "_rcu()" list-traversal
		132	primitives can substitute for a good concurrency design
		133	coordinating among multiple updaters.
124		134
125	b. If the list macros are being used, the list_add_tail_rcu()	135	b. If the list macros are being used, the list_add_tail_rcu()
126	and list_add_rcu() primitives must be used in order	136	and list_add_rcu() primitives must be used in order
@@ -135,11 +145,14 @@ over a rather long period of time, but improvements are always welcome!
135	readers. Similarly, if the hlist macros are being used,	145	readers. Similarly, if the hlist macros are being used,
136	the hlist_del_rcu() primitive is required.	146	the hlist_del_rcu() primitive is required.
137		147
138	The list_replace_rcu() primitive may be used to	148	The list_replace_rcu() and hlist_replace_rcu() primitives
139	replace an old structure with a new one in an	149	may be used to replace an old structure with a new one
140	RCU-protected list.	150	in their respective types of RCU-protected lists.
		151
		152	d. Rules similar to (4b) and (4c) apply to the "hlist_nulls"
		153	type of RCU-protected linked lists.
141		154
142	d. Updates must ensure that initialization of a given	155	e. Updates must ensure that initialization of a given
143	structure happens before pointers to that structure are	156	structure happens before pointers to that structure are
144	publicized. Use the rcu_assign_pointer() primitive	157	publicized. Use the rcu_assign_pointer() primitive
145	when publicizing a pointer to a structure that can	158	when publicizing a pointer to a structure that can
@@ -151,16 +164,31 @@ over a rather long period of time, but improvements are always welcome!
151	it cannot block.	164	it cannot block.
152		165
153	6. Since synchronize_rcu() can block, it cannot be called from	166	6. Since synchronize_rcu() can block, it cannot be called from
154	any sort of irq context. Ditto for synchronize_sched() and	167	any sort of irq context. The same rule applies for
155	synchronize_srcu().	168	synchronize_rcu_bh(), synchronize_sched(), synchronize_srcu(),
156		169	synchronize_rcu_expedited(), synchronize_rcu_bh_expedited(),
157	7. If the updater uses call_rcu(), then the corresponding readers	170	synchronize_sched_expedite(), and synchronize_srcu_expedited().
158	must use rcu_read_lock() and rcu_read_unlock(). If the updater	171
159	uses call_rcu_bh(), then the corresponding readers must use	172	The expedited forms of these primitives have the same semantics
160	rcu_read_lock_bh() and rcu_read_unlock_bh(). If the updater	173	as the non-expedited forms, but expediting is both expensive
161	uses call_rcu_sched(), then the corresponding readers must	174	and unfriendly to real-time workloads. Use of the expedited
162	disable preemption. Mixing things up will result in confusion	175	primitives should be restricted to rare configuration-change
163	and broken kernels.	176	operations that would not normally be undertaken while a real-time
		177	workload is running.
		178
		179	7. If the updater uses call_rcu() or synchronize_rcu(), then the
		180	corresponding readers must use rcu_read_lock() and
		181	rcu_read_unlock(). If the updater uses call_rcu_bh() or
		182	synchronize_rcu_bh(), then the corresponding readers must
		183	use rcu_read_lock_bh() and rcu_read_unlock_bh(). If the
		184	updater uses call_rcu_sched() or synchronize_sched(), then
		185	the corresponding readers must disable preemption, possibly
		186	by calling rcu_read_lock_sched() and rcu_read_unlock_sched().
		187	If the updater uses synchronize_srcu(), the the corresponding
		188	readers must use srcu_read_lock() and srcu_read_unlock(),
		189	and with the same srcu_struct. The rules for the expedited
		190	primitives are the same as for their non-expedited counterparts.
		191	Mixing things up will result in confusion and broken kernels.
164		192
165	One exception to this rule: rcu_read_lock() and rcu_read_unlock()	193	One exception to this rule: rcu_read_lock() and rcu_read_unlock()
166	may be substituted for rcu_read_lock_bh() and rcu_read_unlock_bh()	194	may be substituted for rcu_read_lock_bh() and rcu_read_unlock_bh()
@@ -212,6 +240,8 @@ over a rather long period of time, but improvements are always welcome!
212	e. Periodically invoke synchronize_rcu(), permitting a limited	240	e. Periodically invoke synchronize_rcu(), permitting a limited
213	number of updates per grace period.	241	number of updates per grace period.
214		242
		243	The same cautions apply to call_rcu_bh() and call_rcu_sched().
		244
215	9. All RCU list-traversal primitives, which include	245	9. All RCU list-traversal primitives, which include
216	rcu_dereference(), list_for_each_entry_rcu(),	246	rcu_dereference(), list_for_each_entry_rcu(),
217	list_for_each_continue_rcu(), and list_for_each_safe_rcu(),	247	list_for_each_continue_rcu(), and list_for_each_safe_rcu(),
@@ -229,7 +259,8 @@ over a rather long period of time, but improvements are always welcome!
229	10. Conversely, if you are in an RCU read-side critical section,	259	10. Conversely, if you are in an RCU read-side critical section,
230	and you don't hold the appropriate update-side lock, you -must-	260	and you don't hold the appropriate update-side lock, you -must-
231	use the "_rcu()" variants of the list macros. Failing to do so	261	use the "_rcu()" variants of the list macros. Failing to do so
232	will break Alpha and confuse people reading your code.	262	will break Alpha, cause aggressive compilers to generate bad code,
		263	and confuse people trying to read your code.
233		264
234	11. Note that synchronize_rcu() -only- guarantees to wait until	265	11. Note that synchronize_rcu() -only- guarantees to wait until
235	all currently executing rcu_read_lock()-protected RCU read-side	266	all currently executing rcu_read_lock()-protected RCU read-side
@@ -239,15 +270,21 @@ over a rather long period of time, but improvements are always welcome!
239	rcu_read_lock()-protected read-side critical sections, do -not-	270	rcu_read_lock()-protected read-side critical sections, do -not-
240	use synchronize_rcu().	271	use synchronize_rcu().
241		272
242	If you want to wait for some of these other things, you might	273	Similarly, disabling preemption is not an acceptable substitute
243	instead need to use synchronize_irq() or synchronize_sched().	274	for rcu_read_lock(). Code that attempts to use preemption
		275	disabling where it should be using rcu_read_lock() will break
		276	in real-time kernel builds.
		277
		278	If you want to wait for interrupt handlers, NMI handlers, and
		279	code under the influence of preempt_disable(), you instead
		280	need to use synchronize_irq() or synchronize_sched().
244		281
245	12. Any lock acquired by an RCU callback must be acquired elsewhere	282	12. Any lock acquired by an RCU callback must be acquired elsewhere
246	with softirq disabled, e.g., via spin_lock_irqsave(),	283	with softirq disabled, e.g., via spin_lock_irqsave(),
247	spin_lock_bh(), etc. Failing to disable irq on a given	284	spin_lock_bh(), etc. Failing to disable irq on a given
248	acquisition of that lock will result in deadlock as soon as the	285	acquisition of that lock will result in deadlock as soon as
249	RCU callback happens to interrupt that acquisition's critical	286	the RCU softirq handler happens to run your RCU callback while
250	section.	287	interrupting that acquisition's critical section.
251		288
252	13. RCU callbacks can be and are executed in parallel. In many cases,	289	13. RCU callbacks can be and are executed in parallel. In many cases,
253	the callback code simply wrappers around kfree(), so that this	290	the callback code simply wrappers around kfree(), so that this
@@ -265,29 +302,30 @@ over a rather long period of time, but improvements are always welcome!
265	not the case, a self-spawning RCU callback would prevent the	302	not the case, a self-spawning RCU callback would prevent the
266	victim CPU from ever going offline.)	303	victim CPU from ever going offline.)
267		304
268	14. SRCU (srcu_read_lock(), srcu_read_unlock(), and synchronize_srcu())	305	14. SRCU (srcu_read_lock(), srcu_read_unlock(), synchronize_srcu(),
269	may only be invoked from process context. Unlike other forms of	306	and synchronize_srcu_expedited()) may only be invoked from
270	RCU, it -is- permissible to block in an SRCU read-side critical	307	process context. Unlike other forms of RCU, it -is- permissible
271	section (demarked by srcu_read_lock() and srcu_read_unlock()),	308	to block in an SRCU read-side critical section (demarked by
272	hence the "SRCU": "sleepable RCU". Please note that if you	309	srcu_read_lock() and srcu_read_unlock()), hence the "SRCU":
273	don't need to sleep in read-side critical sections, you should	310	"sleepable RCU". Please note that if you don't need to sleep
274	be using RCU rather than SRCU, because RCU is almost always	311	in read-side critical sections, you should be using RCU rather
275	faster and easier to use than is SRCU.	312	than SRCU, because RCU is almost always faster and easier to
		313	use than is SRCU.
276		314
277	Also unlike other forms of RCU, explicit initialization	315	Also unlike other forms of RCU, explicit initialization
278	and cleanup is required via init_srcu_struct() and	316	and cleanup is required via init_srcu_struct() and
279	cleanup_srcu_struct(). These are passed a "struct srcu_struct"	317	cleanup_srcu_struct(). These are passed a "struct srcu_struct"
280	that defines the scope of a given SRCU domain. Once initialized,	318	that defines the scope of a given SRCU domain. Once initialized,
281	the srcu_struct is passed to srcu_read_lock(), srcu_read_unlock()	319	the srcu_struct is passed to srcu_read_lock(), srcu_read_unlock()
282	and synchronize_srcu(). A given synchronize_srcu() waits only	320	synchronize_srcu(), and synchronize_srcu_expedited(). A given
283	for SRCU read-side critical sections governed by srcu_read_lock()	321	synchronize_srcu() waits only for SRCU read-side critical
284	and srcu_read_unlock() calls that have been passd the same	322	sections governed by srcu_read_lock() and srcu_read_unlock()
285	srcu_struct. This property is what makes sleeping read-side	323	calls that have been passed the same srcu_struct. This property
286	critical sections tolerable -- a given subsystem delays only	324	is what makes sleeping read-side critical sections tolerable --
287	its own updates, not those of other subsystems using SRCU.	325	a given subsystem delays only its own updates, not those of other
288	Therefore, SRCU is less prone to OOM the system than RCU would	326	subsystems using SRCU. Therefore, SRCU is less prone to OOM the
289	be if RCU's read-side critical sections were permitted to	327	system than RCU would be if RCU's read-side critical sections
290	sleep.	328	were permitted to sleep.
291		329
292	The ability to sleep in read-side critical sections does not	330	The ability to sleep in read-side critical sections does not
293	come for free. First, corresponding srcu_read_lock() and	331	come for free. First, corresponding srcu_read_lock() and
@@ -311,12 +349,12 @@ over a rather long period of time, but improvements are always welcome!
311	destructive operation, and -only- -then- invoke call_rcu(),	349	destructive operation, and -only- -then- invoke call_rcu(),
312	synchronize_rcu(), or friends.	350	synchronize_rcu(), or friends.
313		351
314	Because these primitives only wait for pre-existing readers,	352	Because these primitives only wait for pre-existing readers, it
315	it is the caller's responsibility to guarantee safety to	353	is the caller's responsibility to guarantee that any subsequent
316	any subsequent readers.	354	readers will execute safely.
317		355
318	16. The various RCU read-side primitives do -not- contain memory	356	16. The various RCU read-side primitives do -not- necessarily contain
319	barriers. The CPU (and in some cases, the compiler) is free	357	memory barriers. You should therefore plan for the CPU
320	to reorder code into and out of RCU read-side critical sections.	358	and the compiler to freely reorder code into and out of RCU
321	It is the responsibility of the RCU update-side primitives to	359	read-side critical sections. It is the responsibility of the
322	deal with this.	360	RCU update-side primitives to deal with this.