mm/khugepaged: invoke MMU notifiers in shmem/file collapse paths
authorJann Horn <jannh@google.com>
Fri, 25 Nov 2022 21:37:14 +0000 (22:37 +0100)
committerGreg Kroah-Hartman <gregkh@linuxfoundation.org>
Wed, 18 Jan 2023 08:26:04 +0000 (09:26 +0100)
commit f268f6cf875f3220afc77bdd0bf1bb136eb54db9 upstream.

Any codepath that zaps page table entries must invoke MMU notifiers to
ensure that secondary MMUs (like KVM) don't keep accessing pages which
aren't mapped anymore.  Secondary MMUs don't hold their own references to
pages that are mirrored over, so failing to notify them can lead to page
use-after-free.

I'm marking this as addressing an issue introduced in commit f3f0e1d2150b
("khugepaged: add support of collapse for tmpfs/shmem pages"), but most of
the security impact of this only came in commit 27e1f8273113 ("khugepaged:
enable collapse pmd for pte-mapped THP"), which actually omitted flushes
for the removal of present PTEs, not just for the removal of empty page
tables.

Link: https://lkml.kernel.org/r/20221129154730.2274278-3-jannh@google.com
Link: https://lkml.kernel.org/r/20221128180252.1684965-3-jannh@google.com
Link: https://lkml.kernel.org/r/20221125213714.4115729-3-jannh@google.com
Fixes: f3f0e1d2150b ("khugepaged: add support of collapse for tmpfs/shmem pages")
Signed-off-by: Jann Horn <jannh@google.com>
Acked-by: David Hildenbrand <david@redhat.com>
Reviewed-by: Yang Shi <shy828301@gmail.com>
Cc: John Hubbard <jhubbard@nvidia.com>
Cc: Peter Xu <peterx@redhat.com>
Cc: <stable@vger.kernel.org>
Signed-off-by: Andrew Morton <akpm@linux-foundation.org>
[manual backport: this code was refactored from two copies into a common
helper between 5.15 and 6.0;
pmd collapse for PTE-mapped THP was only added in 5.4;
MMU notifier API changed between 4.19 and 5.4]
Signed-off-by: Jann Horn <jannh@google.com>
Signed-off-by: Greg Kroah-Hartman <gregkh@linuxfoundation.org>
mm/khugepaged.c

index 6f8a1b423538a156db6a9960f96896c2e94a1be3..644f0a9c8a55dc76b1b36e95fa7f85fe059e87db 100644 (file)
@@ -1304,13 +1304,20 @@ static void retract_page_tables(struct address_space *mapping, pgoff_t pgoff)
                 */
                if (down_write_trylock(&mm->mmap_sem)) {
                        if (!khugepaged_test_exit(mm)) {
-                               spinlock_t *ptl = pmd_lock(mm, pmd);
+                               spinlock_t *ptl;
+                               unsigned long end = addr + HPAGE_PMD_SIZE;
+
+                               mmu_notifier_invalidate_range_start(mm, addr,
+                                                                   end);
+                               ptl = pmd_lock(mm, pmd);
                                /* assume page table is clear */
                                _pmd = pmdp_collapse_flush(vma, addr, pmd);
                                spin_unlock(ptl);
                                atomic_long_dec(&mm->nr_ptes);
                                tlb_remove_table_sync_one();
                                pte_free(mm, pmd_pgtable(_pmd));
+                               mmu_notifier_invalidate_range_end(mm, addr,
+                                                                 end);
                        }
                        up_write(&mm->mmap_sem);
                }