diff options
Diffstat (limited to 'Documentation/sysctl/vm.txt')
| -rw-r--r-- | Documentation/sysctl/vm.txt | 56 |
1 files changed, 48 insertions, 8 deletions
diff --git a/Documentation/sysctl/vm.txt b/Documentation/sysctl/vm.txt index 391dd64363e7..a46c10fcddfc 100644 --- a/Documentation/sysctl/vm.txt +++ b/Documentation/sysctl/vm.txt | |||
| @@ -28,6 +28,7 @@ Currently, these files are in /proc/sys/vm: | |||
| 28 | - block_dump | 28 | - block_dump |
| 29 | - drop-caches | 29 | - drop-caches |
| 30 | - zone_reclaim_mode | 30 | - zone_reclaim_mode |
| 31 | - zone_reclaim_interval | ||
| 31 | 32 | ||
| 32 | ============================================================== | 33 | ============================================================== |
| 33 | 34 | ||
| @@ -126,15 +127,54 @@ the high water marks for each per cpu page list. | |||
| 126 | 127 | ||
| 127 | zone_reclaim_mode: | 128 | zone_reclaim_mode: |
| 128 | 129 | ||
| 129 | This is set during bootup to 1 if it is determined that pages from | 130 | Zone_reclaim_mode allows to set more or less agressive approaches to |
| 130 | remote zones will cause a significant performance reduction. The | 131 | reclaim memory when a zone runs out of memory. If it is set to zero then no |
| 132 | zone reclaim occurs. Allocations will be satisfied from other zones / nodes | ||
| 133 | in the system. | ||
| 134 | |||
| 135 | This is value ORed together of | ||
| 136 | |||
| 137 | 1 = Zone reclaim on | ||
| 138 | 2 = Zone reclaim writes dirty pages out | ||
| 139 | 4 = Zone reclaim swaps pages | ||
| 140 | 8 = Also do a global slab reclaim pass | ||
| 141 | |||
| 142 | zone_reclaim_mode is set during bootup to 1 if it is determined that pages | ||
| 143 | from remote zones will cause a measurable performance reduction. The | ||
| 131 | page allocator will then reclaim easily reusable pages (those page | 144 | page allocator will then reclaim easily reusable pages (those page |
| 132 | cache pages that are currently not used) before going off node. | 145 | cache pages that are currently not used) before allocating off node pages. |
| 146 | |||
| 147 | It may be beneficial to switch off zone reclaim if the system is | ||
| 148 | used for a file server and all of memory should be used for caching files | ||
| 149 | from disk. In that case the caching effect is more important than | ||
| 150 | data locality. | ||
| 151 | |||
| 152 | Allowing zone reclaim to write out pages stops processes that are | ||
| 153 | writing large amounts of data from dirtying pages on other nodes. Zone | ||
| 154 | reclaim will write out dirty pages if a zone fills up and so effectively | ||
| 155 | throttle the process. This may decrease the performance of a single process | ||
| 156 | since it cannot use all of system memory to buffer the outgoing writes | ||
| 157 | anymore but it preserve the memory on other nodes so that the performance | ||
| 158 | of other processes running on other nodes will not be affected. | ||
| 159 | |||
| 160 | Allowing regular swap effectively restricts allocations to the local | ||
| 161 | node unless explicitly overridden by memory policies or cpuset | ||
| 162 | configurations. | ||
| 163 | |||
| 164 | It may be advisable to allow slab reclaim if the system makes heavy | ||
| 165 | use of files and builds up large slab caches. However, the slab | ||
| 166 | shrink operation is global, may take a long time and free slabs | ||
| 167 | in all nodes of the system. | ||
| 168 | |||
| 169 | ================================================================ | ||
| 170 | |||
| 171 | zone_reclaim_interval: | ||
| 172 | |||
| 173 | The time allowed for off node allocations after zone reclaim | ||
| 174 | has failed to reclaim enough pages to allow a local allocation. | ||
| 133 | 175 | ||
| 134 | The user can override this setting. It may be beneficial to switch | 176 | Time is set in seconds and set by default to 30 seconds. |
| 135 | off zone reclaim if the system is used for a file server and all | ||
| 136 | of memory should be used for caching files from disk. | ||
| 137 | 177 | ||
| 138 | It may be beneficial to switch this on if one wants to do zone | 178 | Reduce the interval if undesired off node allocations occur. However, too |
| 139 | reclaim regardless of the numa distances in the system. | 179 | frequent scans will have a negative impact onoff node allocation performance. |
| 140 | 180 | ||
