kernel_samsung_a53x/mm
Vlastimil Babka b4cd121105 mm, vmscan: prevent infinite loop for costly GFP_NOIO | __GFP_RETRY_MAYFAIL allocations
commit 803de9000f334b771afacb6ff3e78622916668b0 upstream.

Sven reports an infinite loop in __alloc_pages_slowpath() for costly order
__GFP_RETRY_MAYFAIL allocations that are also GFP_NOIO.  Such combination
can happen in a suspend/resume context where a GFP_KERNEL allocation can
have __GFP_IO masked out via gfp_allowed_mask.

Quoting Sven:

1. try to do a "costly" allocation (order > PAGE_ALLOC_COSTLY_ORDER)
   with __GFP_RETRY_MAYFAIL set.

2. page alloc's __alloc_pages_slowpath tries to get a page from the
   freelist. This fails because there is nothing free of that costly
   order.

3. page alloc tries to reclaim by calling __alloc_pages_direct_reclaim,
   which bails out because a zone is ready to be compacted; it pretends
   to have made a single page of progress.

4. page alloc tries to compact, but this always bails out early because
   __GFP_IO is not set (it's not passed by the snd allocator, and even
   if it were, we are suspending so the __GFP_IO flag would be cleared
   anyway).

5. page alloc believes reclaim progress was made (because of the
   pretense in item 3) and so it checks whether it should retry
   compaction. The compaction retry logic thinks it should try again,
   because:
    a) reclaim is needed because of the early bail-out in item 4
    b) a zonelist is suitable for compaction

6. goto 2. indefinite stall.

(end quote)

The immediate root cause is confusing the COMPACT_SKIPPED returned from
__alloc_pages_direct_compact() (step 4) due to lack of __GFP_IO to be
indicating a lack of order-0 pages, and in step 5 evaluating that in
should_compact_retry() as a reason to retry, before incrementing and
limiting the number of retries.  There are however other places that
wrongly assume that compaction can happen while we lack __GFP_IO.

To fix this, introduce gfp_compaction_allowed() to abstract the __GFP_IO
evaluation and switch the open-coded test in try_to_compact_pages() to use
it.

Also use the new helper in:
- compaction_ready(), which will make reclaim not bail out in step 3, so
  there's at least one attempt to actually reclaim, even if chances are
  small for a costly order
- in_reclaim_compaction() which will make should_continue_reclaim()
  return false and we don't over-reclaim unnecessarily
- in __alloc_pages_slowpath() to set a local variable can_compact,
  which is then used to avoid retrying reclaim/compaction for costly
  allocations (step 5) if we can't compact and also to skip the early
  compaction attempt that we do in some cases

Link: https://lkml.kernel.org/r/20240221114357.13655-2-vbabka@suse.cz
Fixes: 3250845d0526 ("Revert "mm, oom: prevent premature OOM killer invocation for high order request"")
Signed-off-by: Vlastimil Babka <vbabka@suse.cz>
Reported-by: Sven van Ashbrook <svenva@chromium.org>
Closes: https://lore.kernel.org/all/CAG-rBihs_xMKb3wrMO1%2B-%2Bp4fowP9oy1pa_OTkfxBzPUVOZF%2Bg@mail.gmail.com/
Tested-by: Karthikeyan Ramasubramanian <kramasub@chromium.org>
Cc: Brian Geffon <bgeffon@google.com>
Cc: Curtis Malainey <cujomalainey@chromium.org>
Cc: Jaroslav Kysela <perex@perex.cz>
Cc: Mel Gorman <mgorman@techsingularity.net>
Cc: Michal Hocko <mhocko@kernel.org>
Cc: Takashi Iwai <tiwai@suse.com>
Cc: <stable@vger.kernel.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
Signed-off-by: Vlastimil Babka <vbabka@suse.cz>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
2024-11-19 09:22:46 +01:00
..
damon Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
kasan Revert "kasan: print the original fault addr when access invalid shadow" 2024-11-18 11:06:01 +01:00
kfence Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
sec_mm Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
backing-dev.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
balloon_compaction.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
cleancache.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
cma.c mm/cma: use nth_page() in place of direct struct page manipulation 2024-11-18 11:43:25 +01:00
cma.h Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
cma_debug.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
cma_sysfs.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
compaction.c mm, vmscan: prevent infinite loop for costly GFP_NOIO | __GFP_RETRY_MAYFAIL allocations 2024-11-19 09:22:46 +01:00
debug.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
debug_page_ref.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
debug_vm_pgtable.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
dmapool.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
early_ioremap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
fadvise.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
failslab.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
filemap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
frame_vector.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
frontswap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
gup.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
gup_benchmark.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
highmem.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
hmm.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
hpa.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
huge_memory.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
hugetlb.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
hugetlb_cgroup.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
hwpoison-inject.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
init-mm.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
internal.h Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
interval_tree.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
io_record.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
ioremap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
Kconfig Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
Kconfig.debug Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
khugepaged.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
kmemleak.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
ksm.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
kzerod.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
list_lru.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
maccess.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
madvise.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
Makefile Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mapping_dirty_helpers.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
memblock.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
memcontrol.c mm: kmem: drop __GFP_NOFAIL when allocating objcg vectors 2024-11-18 11:43:31 +01:00
memfd.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
memory-failure.c mm/memory-failure: fix an incorrect use of tail pages 2024-11-19 09:22:41 +01:00
memory.c mm: fix unmap_mapping_range high bits shift bug 2024-11-18 12:12:08 +01:00
memory_hotplug.c mm/memory_hotplug: use pfn math in place of direct struct page manipulation 2024-11-18 11:43:25 +01:00
mempolicy.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mempool.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
memremap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
memtest.c memtest: use {READ,WRITE}_ONCE in memory scanning 2024-11-19 09:22:36 +01:00
migrate.c mm/migrate: set swap entry values of THP tail pages properly. 2024-11-19 09:22:41 +01:00
mincore.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mlock.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mm_init.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mmap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mmu_gather.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mmu_notifier.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mmzone.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mprotect.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
mremap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
msync.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
nommu.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
oom_kill.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page-writeback.c mm/writeback: fix possible divide-by-zero in wb_dirty_limits(), again 2024-11-18 12:13:28 +01:00
page_alloc.c mm, vmscan: prevent infinite loop for costly GFP_NOIO | __GFP_RETRY_MAYFAIL allocations 2024-11-19 09:22:46 +01:00
page_counter.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_ext.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_idle.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_io.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_isolation.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_owner.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_pinner.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_poison.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_reporting.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_reporting.h Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
page_vma_mapped.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
pagewalk.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
percpu-internal.h Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
percpu-km.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
percpu-stats.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
percpu-vm.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
percpu.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
pgalloc-track.h Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
pgtable-generic.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
process_vm_access.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
ptdump.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
readahead.c vfs: fix readahead(2) on block devices 2024-11-18 11:42:46 +01:00
rmap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
rodata_test.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
shmem.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
shuffle.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
shuffle.h Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
slab.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
slab.h Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
slab_common.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
slob.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
slub.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
sparse-vmemmap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
sparse.c mm/sparsemem: fix race in accessing memory_section->usage 2024-11-18 12:13:06 +01:00
swap.c mm: swap - set page_cluster at 0 2024-11-17 17:37:44 +01:00
swap_cgroup.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
swap_slots.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
swap_state.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
swapfile.c mm: swap: fix race between free_swap_and_cache() and swapoff() 2024-11-19 09:22:33 +01:00
truncate.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
usercopy.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
userfaultfd.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
util.c mm: vmalloc: introduce array allocation functions 2024-11-18 12:12:59 +01:00
vmacache.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
vmalloc.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
vmpressure.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
vmscan.c mm, vmscan: prevent infinite loop for costly GFP_NOIO | __GFP_RETRY_MAYFAIL allocations 2024-11-19 09:22:46 +01:00
vmstat.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
workingset.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
z3fold.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
zbud.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
zpool.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00
zsmalloc.c zsmalloc: use copy_page for full page copy 2024-11-17 17:41:35 +01:00
zswap.c Import A536BXXU9EXDC 2024-06-15 16:02:09 -03:00