memory-hotplug: introduce new arch_remove_memory() for removing page table
[GitHub/mt8127/android_kernel_alcatel_ttab.git] / mm / Kconfig
CommitLineData
e1785e85
DH
1config SELECT_MEMORY_MODEL
2 def_bool y
a8826eeb 3 depends on ARCH_SELECT_MEMORY_MODEL
e1785e85 4
3a9da765
DH
5choice
6 prompt "Memory model"
e1785e85
DH
7 depends on SELECT_MEMORY_MODEL
8 default DISCONTIGMEM_MANUAL if ARCH_DISCONTIGMEM_DEFAULT
d41dee36 9 default SPARSEMEM_MANUAL if ARCH_SPARSEMEM_DEFAULT
e1785e85 10 default FLATMEM_MANUAL
3a9da765 11
e1785e85 12config FLATMEM_MANUAL
3a9da765 13 bool "Flat Memory"
c898ec16 14 depends on !(ARCH_DISCONTIGMEM_ENABLE || ARCH_SPARSEMEM_ENABLE) || ARCH_FLATMEM_ENABLE
3a9da765
DH
15 help
16 This option allows you to change some of the ways that
17 Linux manages its memory internally. Most users will
18 only have one option here: FLATMEM. This is normal
19 and a correct option.
20
d41dee36
AW
21 Some users of more advanced features like NUMA and
22 memory hotplug may have different options here.
23 DISCONTIGMEM is an more mature, better tested system,
24 but is incompatible with memory hotplug and may suffer
25 decreased performance over SPARSEMEM. If unsure between
26 "Sparse Memory" and "Discontiguous Memory", choose
27 "Discontiguous Memory".
28
29 If unsure, choose this option (Flat Memory) over any other.
3a9da765 30
e1785e85 31config DISCONTIGMEM_MANUAL
f3519f91 32 bool "Discontiguous Memory"
3a9da765
DH
33 depends on ARCH_DISCONTIGMEM_ENABLE
34 help
785dcd44
DH
35 This option provides enhanced support for discontiguous
36 memory systems, over FLATMEM. These systems have holes
37 in their physical address spaces, and this option provides
38 more efficient handling of these holes. However, the vast
39 majority of hardware has quite flat address spaces, and
ad3d0a38 40 can have degraded performance from the extra overhead that
785dcd44
DH
41 this option imposes.
42
43 Many NUMA configurations will have this as the only option.
44
3a9da765
DH
45 If unsure, choose "Flat Memory" over this option.
46
d41dee36
AW
47config SPARSEMEM_MANUAL
48 bool "Sparse Memory"
49 depends on ARCH_SPARSEMEM_ENABLE
50 help
51 This will be the only option for some systems, including
52 memory hotplug systems. This is normal.
53
54 For many other systems, this will be an alternative to
f3519f91 55 "Discontiguous Memory". This option provides some potential
d41dee36
AW
56 performance benefits, along with decreased code complexity,
57 but it is newer, and more experimental.
58
59 If unsure, choose "Discontiguous Memory" or "Flat Memory"
60 over this option.
61
3a9da765
DH
62endchoice
63
e1785e85
DH
64config DISCONTIGMEM
65 def_bool y
66 depends on (!SELECT_MEMORY_MODEL && ARCH_DISCONTIGMEM_ENABLE) || DISCONTIGMEM_MANUAL
67
d41dee36
AW
68config SPARSEMEM
69 def_bool y
1a83e175 70 depends on (!SELECT_MEMORY_MODEL && ARCH_SPARSEMEM_ENABLE) || SPARSEMEM_MANUAL
d41dee36 71
e1785e85
DH
72config FLATMEM
73 def_bool y
d41dee36
AW
74 depends on (!DISCONTIGMEM && !SPARSEMEM) || FLATMEM_MANUAL
75
76config FLAT_NODE_MEM_MAP
77 def_bool y
78 depends on !SPARSEMEM
e1785e85 79
93b7504e
DH
80#
81# Both the NUMA code and DISCONTIGMEM use arrays of pg_data_t's
82# to represent different areas of memory. This variable allows
83# those dependencies to exist individually.
84#
85config NEED_MULTIPLE_NODES
86 def_bool y
87 depends on DISCONTIGMEM || NUMA
af705362
AW
88
89config HAVE_MEMORY_PRESENT
90 def_bool y
d41dee36 91 depends on ARCH_HAVE_MEMORY_PRESENT || SPARSEMEM
802f192e 92
3e347261
BP
93#
94# SPARSEMEM_EXTREME (which is the default) does some bootmem
84eb8d06 95# allocations when memory_present() is called. If this cannot
3e347261
BP
96# be done on your architecture, select this option. However,
97# statically allocating the mem_section[] array can potentially
98# consume vast quantities of .bss, so be careful.
99#
100# This option will also potentially produce smaller runtime code
101# with gcc 3.4 and later.
102#
103config SPARSEMEM_STATIC
9ba16087 104 bool
3e347261 105
802f192e 106#
44c09201 107# Architecture platforms which require a two level mem_section in SPARSEMEM
802f192e
BP
108# must select this option. This is usually for architecture platforms with
109# an extremely sparse physical address space.
110#
3e347261
BP
111config SPARSEMEM_EXTREME
112 def_bool y
113 depends on SPARSEMEM && !SPARSEMEM_STATIC
4c21e2f2 114
29c71111 115config SPARSEMEM_VMEMMAP_ENABLE
9ba16087 116 bool
29c71111 117
9bdac914
YL
118config SPARSEMEM_ALLOC_MEM_MAP_TOGETHER
119 def_bool y
120 depends on SPARSEMEM && X86_64
121
29c71111 122config SPARSEMEM_VMEMMAP
a5ee6daa
GL
123 bool "Sparse Memory virtual memmap"
124 depends on SPARSEMEM && SPARSEMEM_VMEMMAP_ENABLE
125 default y
126 help
127 SPARSEMEM_VMEMMAP uses a virtually mapped memmap to optimise
128 pfn_to_page and page_to_pfn operations. This is the most
129 efficient option when sufficient kernel resources are available.
29c71111 130
95f72d1e
YL
131config HAVE_MEMBLOCK
132 boolean
133
7c0caeb8
TH
134config HAVE_MEMBLOCK_NODE_MAP
135 boolean
136
c378ddd5
TH
137config ARCH_DISCARD_MEMBLOCK
138 boolean
139
66616720
SR
140config NO_BOOTMEM
141 boolean
142
ee6f509c
MK
143config MEMORY_ISOLATION
144 boolean
145
20b2f52b
LJ
146config MOVABLE_NODE
147 boolean "Enable to assign a node which has only movable memory"
148 depends on HAVE_MEMBLOCK
149 depends on NO_BOOTMEM
150 depends on X86_64
151 depends on NUMA
c2974058
TC
152 default n
153 help
154 Allow a node to have only movable memory. Pages used by the kernel,
155 such as direct mapping pages cannot be migrated. So the corresponding
156 memory device cannot be hotplugged. This option allows users to
157 online all the memory of a node as movable memory so that the whole
158 node can be hotplugged. Users who don't use the memory hotplug
159 feature are fine with this option on since they don't online memory
160 as movable.
161
162 Say Y here if you want to hotplug a whole node.
163 Say N here if you want kernel to use memory on all nodes evenly.
20b2f52b 164
3947be19
DH
165# eventually, we can have this option just 'select SPARSEMEM'
166config MEMORY_HOTPLUG
167 bool "Allow for memory hot-add"
ee6f509c 168 select MEMORY_ISOLATION
ec69acbb 169 depends on SPARSEMEM || X86_64_ACPI_NUMA
6ad696d2 170 depends on HOTPLUG && ARCH_ENABLE_MEMORY_HOTPLUG
ed84a07a 171 depends on (IA64 || X86 || PPC_BOOK3S_64 || SUPERH || S390)
3947be19 172
ec69acbb
KM
173config MEMORY_HOTPLUG_SPARSE
174 def_bool y
175 depends on SPARSEMEM && MEMORY_HOTPLUG
176
0c0e6195
KH
177config MEMORY_HOTREMOVE
178 bool "Allow for memory hot remove"
179 depends on MEMORY_HOTPLUG && ARCH_ENABLE_MEMORY_HOTREMOVE
180 depends on MIGRATION
181
e20b8cca
CL
182#
183# If we have space for more page flags then we can enable additional
184# optimizations and functionality.
185#
186# Regular Sparsemem takes page flag bits for the sectionid if it does not
187# use a virtual memmap. Disable extended page flags for 32 bit platforms
188# that require the use of a sectionid in the page flags.
189#
190config PAGEFLAGS_EXTENDED
191 def_bool y
a269cca9 192 depends on 64BIT || SPARSEMEM_VMEMMAP || !SPARSEMEM
e20b8cca 193
4c21e2f2
HD
194# Heavily threaded applications may benefit from splitting the mm-wide
195# page_table_lock, so that faults on different parts of the user address
196# space can be handled with less contention: split it at this NR_CPUS.
197# Default to 4 for wider testing, though 8 might be more appropriate.
198# ARM's adjust_pte (unused if VIPT) depends on mm-wide page_table_lock.
7b6ac9df 199# PA-RISC 7xxx's spinlock_t would enlarge struct page from 32 to 44 bytes.
a70caa8b 200# DEBUG_SPINLOCK and DEBUG_LOCK_ALLOC spinlock_t also enlarge struct page.
4c21e2f2
HD
201#
202config SPLIT_PTLOCK_CPUS
203 int
a70caa8b
HD
204 default "999999" if ARM && !CPU_CACHE_VIPT
205 default "999999" if PARISC && !PA20
206 default "999999" if DEBUG_SPINLOCK || DEBUG_LOCK_ALLOC
4c21e2f2 207 default "4"
7cbe34cf 208
18468d93
RA
209#
210# support for memory balloon compaction
211config BALLOON_COMPACTION
212 bool "Allow for balloon memory compaction/migration"
213 def_bool y
214 depends on COMPACTION && VIRTIO_BALLOON
215 help
216 Memory fragmentation introduced by ballooning might reduce
217 significantly the number of 2MB contiguous memory blocks that can be
218 used within a guest, thus imposing performance penalties associated
219 with the reduced number of transparent huge pages that could be used
220 by the guest workload. Allowing the compaction & migration for memory
221 pages enlisted as being part of memory balloon devices avoids the
222 scenario aforementioned and helps improving memory defragmentation.
223
e9e96b39
MG
224#
225# support for memory compaction
226config COMPACTION
227 bool "Allow for memory compaction"
05106e6a 228 def_bool y
e9e96b39 229 select MIGRATION
33a93877 230 depends on MMU
e9e96b39
MG
231 help
232 Allows the compaction of memory for the allocation of huge pages.
233
7cbe34cf
CL
234#
235# support for page migration
236#
237config MIGRATION
b20a3503 238 bool "Page migration"
6c5240ae 239 def_bool y
47118af0 240 depends on NUMA || ARCH_ENABLE_MEMORY_HOTREMOVE || COMPACTION || CMA
b20a3503
CL
241 help
242 Allows the migration of the physical location of pages of processes
e9e96b39
MG
243 while the virtual addresses are not changed. This is useful in
244 two situations. The first is on NUMA systems to put pages nearer
245 to the processors accessing. The second is when allocating huge
246 pages as migration can relocate pages to satisfy a huge page
247 allocation instead of reclaiming.
6550e07f 248
600715dc
JF
249config PHYS_ADDR_T_64BIT
250 def_bool 64BIT || ARCH_PHYS_ADDR_T_64BIT
251
4b51d669
CL
252config ZONE_DMA_FLAG
253 int
254 default "0" if !ZONE_DMA
255 default "1"
256
2a7326b5
CL
257config BOUNCE
258 def_bool y
259 depends on BLOCK && MMU && (ZONE_DMA || HIGHMEM)
260
ffecfd1a
DW
261# On the 'tile' arch, USB OHCI needs the bounce pool since tilegx will often
262# have more than 4GB of memory, but we don't currently use the IOTLB to present
263# a 32-bit address to OHCI. So we need to use a bounce pool instead.
264#
265# We also use the bounce pool to provide stable page writes for jbd. jbd
266# initiates buffer writeback without locking the page or setting PG_writeback,
267# and fixing that behavior (a second time; jbd2 doesn't have this problem) is
268# a major rework effort. Instead, use the bounce buffer to snapshot pages
269# (until jbd goes away). The only jbd user is ext3.
270config NEED_BOUNCE_POOL
271 bool
272 default y if (TILE && USB_OHCI_HCD) || (BLK_DEV_INTEGRITY && JBD)
273
6225e937
CL
274config NR_QUICK
275 int
276 depends on QUICKLIST
0176bd3d 277 default "2" if AVR32
6225e937 278 default "1"
f057eac0
SR
279
280config VIRT_TO_BUS
281 def_bool y
282 depends on !ARCH_NO_VIRT_TO_BUS
cddb8a5c
AA
283
284config MMU_NOTIFIER
285 bool
fc4d5c29 286
f8af4da3
HD
287config KSM
288 bool "Enable KSM for page merging"
289 depends on MMU
290 help
291 Enable Kernel Samepage Merging: KSM periodically scans those areas
292 of an application's address space that an app has advised may be
293 mergeable. When it finds pages of identical content, it replaces
d0f209f6 294 the many instances by a single page with that content, so
f8af4da3
HD
295 saving memory until one or another app needs to modify the content.
296 Recommended for use with KVM, or with other duplicative applications.
c73602ad
HD
297 See Documentation/vm/ksm.txt for more information: KSM is inactive
298 until a program has madvised that an area is MADV_MERGEABLE, and
299 root has set /sys/kernel/mm/ksm/run to 1 (if CONFIG_SYSFS is set).
f8af4da3 300
e0a94c2a
CL
301config DEFAULT_MMAP_MIN_ADDR
302 int "Low address space to protect from user allocation"
6e141546 303 depends on MMU
e0a94c2a
CL
304 default 4096
305 help
306 This is the portion of low virtual memory which should be protected
307 from userspace allocation. Keeping a user from writing to low pages
308 can help reduce the impact of kernel NULL pointer bugs.
309
310 For most ia64, ppc64 and x86 users with lots of address space
311 a value of 65536 is reasonable and should cause no problems.
312 On arm and other archs it should not be higher than 32768.
788084ab
EP
313 Programs which use vm86 functionality or have some need to map
314 this low address space will need CAP_SYS_RAWIO or disable this
315 protection by setting the value to 0.
e0a94c2a
CL
316
317 This value can be changed after boot using the
318 /proc/sys/vm/mmap_min_addr tunable.
319
d949f36f
LT
320config ARCH_SUPPORTS_MEMORY_FAILURE
321 bool
e0a94c2a 322
6a46079c
AK
323config MEMORY_FAILURE
324 depends on MMU
d949f36f 325 depends on ARCH_SUPPORTS_MEMORY_FAILURE
6a46079c 326 bool "Enable recovery from hardware memory errors"
ee6f509c 327 select MEMORY_ISOLATION
6a46079c
AK
328 help
329 Enables code to recover from some memory failures on systems
330 with MCA recovery. This allows a system to continue running
331 even when some of its memory has uncorrected errors. This requires
332 special hardware support and typically ECC memory.
333
cae681fc 334config HWPOISON_INJECT
413f9efb 335 tristate "HWPoison pages injector"
27df5068 336 depends on MEMORY_FAILURE && DEBUG_KERNEL && PROC_FS
478c5ffc 337 select PROC_PAGE_MONITOR
cae681fc 338
fc4d5c29
DH
339config NOMMU_INITIAL_TRIM_EXCESS
340 int "Turn on mmap() excess space trimming before booting"
341 depends on !MMU
342 default 1
343 help
344 The NOMMU mmap() frequently needs to allocate large contiguous chunks
345 of memory on which to store mappings, but it can only ask the system
346 allocator for chunks in 2^N*PAGE_SIZE amounts - which is frequently
347 more than it requires. To deal with this, mmap() is able to trim off
348 the excess and return it to the allocator.
349
350 If trimming is enabled, the excess is trimmed off and returned to the
351 system allocator, which can cause extra fragmentation, particularly
352 if there are a lot of transient processes.
353
354 If trimming is disabled, the excess is kept, but not used, which for
355 long-term mappings means that the space is wasted.
356
357 Trimming can be dynamically controlled through a sysctl option
358 (/proc/sys/vm/nr_trim_pages) which specifies the minimum number of
359 excess pages there must be before trimming should occur, or zero if
360 no trimming is to occur.
361
362 This option specifies the initial value of this option. The default
363 of 1 says that all excess pages should be trimmed.
364
365 See Documentation/nommu-mmap.txt for more information.
bbddff05 366
4c76d9d1 367config TRANSPARENT_HUGEPAGE
13ece886 368 bool "Transparent Hugepage Support"
15626062 369 depends on HAVE_ARCH_TRANSPARENT_HUGEPAGE
5d689240 370 select COMPACTION
4c76d9d1
AA
371 help
372 Transparent Hugepages allows the kernel to use huge pages and
373 huge tlb transparently to the applications whenever possible.
374 This feature can improve computing performance to certain
375 applications by speeding up page faults during memory
376 allocation, by reducing the number of tlb misses and by speeding
377 up the pagetable walking.
378
379 If memory constrained on embedded, you may want to say N.
380
13ece886
AA
381choice
382 prompt "Transparent Hugepage Support sysfs defaults"
383 depends on TRANSPARENT_HUGEPAGE
384 default TRANSPARENT_HUGEPAGE_ALWAYS
385 help
386 Selects the sysfs defaults for Transparent Hugepage Support.
387
388 config TRANSPARENT_HUGEPAGE_ALWAYS
389 bool "always"
390 help
391 Enabling Transparent Hugepage always, can increase the
392 memory footprint of applications without a guaranteed
393 benefit but it will work automatically for all applications.
394
395 config TRANSPARENT_HUGEPAGE_MADVISE
396 bool "madvise"
397 help
398 Enabling Transparent Hugepage madvise, will only provide a
399 performance improvement benefit to the applications using
400 madvise(MADV_HUGEPAGE) but it won't risk to increase the
401 memory footprint of applications without a guaranteed
402 benefit.
403endchoice
404
5febcbe9
CY
405config CROSS_MEMORY_ATTACH
406 bool "Cross Memory Support"
407 depends on MMU
408 default y
409 help
410 Enabling this option adds the system calls process_vm_readv and
411 process_vm_writev which allow a process with the correct privileges
412 to directly read from or write to to another process's address space.
413 See the man page for more details.
414
bbddff05
TH
415#
416# UP and nommu archs use km based percpu allocator
417#
418config NEED_PER_CPU_KM
419 depends on !SMP
420 bool
421 default y
077b1f83
DM
422
423config CLEANCACHE
424 bool "Enable cleancache driver to cache clean pages if tmem is present"
425 default n
426 help
427 Cleancache can be thought of as a page-granularity victim cache
428 for clean pages that the kernel's pageframe replacement algorithm
429 (PFRA) would like to keep around, but can't since there isn't enough
430 memory. So when the PFRA "evicts" a page, it first attempts to use
140a1ef2 431 cleancache code to put the data contained in that page into
077b1f83
DM
432 "transcendent memory", memory that is not directly accessible or
433 addressable by the kernel and is of unknown and possibly
434 time-varying size. And when a cleancache-enabled
435 filesystem wishes to access a page in a file on disk, it first
436 checks cleancache to see if it already contains it; if it does,
437 the page is copied into the kernel and a disk access is avoided.
438 When a transcendent memory driver is available (such as zcache or
439 Xen transcendent memory), a significant I/O reduction
440 may be achieved. When none is available, all cleancache calls
441 are reduced to a single pointer-compare-against-NULL resulting
442 in a negligible performance hit.
443
444 If unsure, say Y to enable cleancache
27c6aec2
DM
445
446config FRONTSWAP
447 bool "Enable frontswap to cache swap pages if tmem is present"
448 depends on SWAP
449 default n
450 help
451 Frontswap is so named because it can be thought of as the opposite
452 of a "backing" store for a swap device. The data is stored into
453 "transcendent memory", memory that is not directly accessible or
454 addressable by the kernel and is of unknown and possibly
455 time-varying size. When space in transcendent memory is available,
456 a significant swap I/O reduction may be achieved. When none is
457 available, all frontswap calls are reduced to a single pointer-
458 compare-against-NULL resulting in a negligible performance hit
459 and swap data is stored as normal on the matching swap device.
460
461 If unsure, say Y to enable frontswap.