diff options
Diffstat (limited to 'Documentation/sysctl/vm.txt')
-rw-r--r-- | Documentation/sysctl/vm.txt | 56 |
1 files changed, 48 insertions, 8 deletions
diff --git a/Documentation/sysctl/vm.txt b/Documentation/sysctl/vm.txt index 391dd64363e7..a46c10fcddfc 100644 --- a/Documentation/sysctl/vm.txt +++ b/Documentation/sysctl/vm.txt | |||
@@ -28,6 +28,7 @@ Currently, these files are in /proc/sys/vm: | |||
28 | - block_dump | 28 | - block_dump |
29 | - drop-caches | 29 | - drop-caches |
30 | - zone_reclaim_mode | 30 | - zone_reclaim_mode |
31 | - zone_reclaim_interval | ||
31 | 32 | ||
32 | ============================================================== | 33 | ============================================================== |
33 | 34 | ||
@@ -126,15 +127,54 @@ the high water marks for each per cpu page list. | |||
126 | 127 | ||
127 | zone_reclaim_mode: | 128 | zone_reclaim_mode: |
128 | 129 | ||
129 | This is set during bootup to 1 if it is determined that pages from | 130 | Zone_reclaim_mode allows to set more or less agressive approaches to |
130 | remote zones will cause a significant performance reduction. The | 131 | reclaim memory when a zone runs out of memory. If it is set to zero then no |
132 | zone reclaim occurs. Allocations will be satisfied from other zones / nodes | ||
133 | in the system. | ||
134 | |||
135 | This is value ORed together of | ||
136 | |||
137 | 1 = Zone reclaim on | ||
138 | 2 = Zone reclaim writes dirty pages out | ||
139 | 4 = Zone reclaim swaps pages | ||
140 | 8 = Also do a global slab reclaim pass | ||
141 | |||
142 | zone_reclaim_mode is set during bootup to 1 if it is determined that pages | ||
143 | from remote zones will cause a measurable performance reduction. The | ||
131 | page allocator will then reclaim easily reusable pages (those page | 144 | page allocator will then reclaim easily reusable pages (those page |
132 | cache pages that are currently not used) before going off node. | 145 | cache pages that are currently not used) before allocating off node pages. |
146 | |||
147 | It may be beneficial to switch off zone reclaim if the system is | ||
148 | used for a file server and all of memory should be used for caching files | ||
149 | from disk. In that case the caching effect is more important than | ||
150 | data locality. | ||
151 | |||
152 | Allowing zone reclaim to write out pages stops processes that are | ||
153 | writing large amounts of data from dirtying pages on other nodes. Zone | ||
154 | reclaim will write out dirty pages if a zone fills up and so effectively | ||
155 | throttle the process. This may decrease the performance of a single process | ||
156 | since it cannot use all of system memory to buffer the outgoing writes | ||
157 | anymore but it preserve the memory on other nodes so that the performance | ||
158 | of other processes running on other nodes will not be affected. | ||
159 | |||
160 | Allowing regular swap effectively restricts allocations to the local | ||
161 | node unless explicitly overridden by memory policies or cpuset | ||
162 | configurations. | ||
163 | |||
164 | It may be advisable to allow slab reclaim if the system makes heavy | ||
165 | use of files and builds up large slab caches. However, the slab | ||
166 | shrink operation is global, may take a long time and free slabs | ||
167 | in all nodes of the system. | ||
168 | |||
169 | ================================================================ | ||
170 | |||
171 | zone_reclaim_interval: | ||
172 | |||
173 | The time allowed for off node allocations after zone reclaim | ||
174 | has failed to reclaim enough pages to allow a local allocation. | ||
133 | 175 | ||
134 | The user can override this setting. It may be beneficial to switch | 176 | Time is set in seconds and set by default to 30 seconds. |
135 | off zone reclaim if the system is used for a file server and all | ||
136 | of memory should be used for caching files from disk. | ||
137 | 177 | ||
138 | It may be beneficial to switch this on if one wants to do zone | 178 | Reduce the interval if undesired off node allocations occur. However, too |
139 | reclaim regardless of the numa distances in the system. | 179 | frequent scans will have a negative impact onoff node allocation performance. |
140 | 180 | ||