linux/arch
Hugh Dickins e0da382c92 [PATCH] freepgt: free_pgtables use vma list
Recent woes with some arches needing their own pgd_addr_end macro; and 4-level
clear_page_range regression since 2.6.10's clear_page_tables; and its
long-standing well-known inefficiency in searching throughout the higher-level
page tables for those few entries to clear and free: all can be blamed on
ignoring the list of vmas when we free page tables.

Replace exit_mmap's clear_page_range of the total user address space by
free_pgtables operating on the mm's vma list; unmap_region use it in the same
way, giving floor and ceiling beyond which it may not free tables.  This
brings lmbench fork/exec/sh numbers back to 2.6.10 (unless preempt is enabled,
in which case latency fixes spoil unmap_vmas throughput).

Beware: the do_mmap_pgoff driver failure case must now use unmap_region
instead of zap_page_range, since a page table might have been allocated, and
can only be freed while it is touched by some vma.

Move free_pgtables from mmap.c to memory.c, where its lower levels are adapted
from the clear_page_range levels.  (Most of free_pgtables' old code was
actually for a non-existent case, prev not properly set up, dating from before
hch gave us split_vma.) Pass mmu_gather** in the public interfaces, since we
might want to add latency lockdrops later; but no attempt to do so yet, going
by vma should itself reduce latency.

But what if is_hugepage_only_range?  Those ia64 and ppc64 cases need careful
examination: put that off until a later patch of the series.

What of x86_64's 32bit vdso page __map_syscall32 maps outside any vma?

And the range to sparc64's flush_tlb_pgtables?  It's less clear to me now that
we need to do more than is done here - every PMD_SIZE ever occupied will be
flushed, do we really have to flush every PGDIR_SIZE ever partially occupied? 
A shame to complicate it unnecessarily.

Special thanks to David Miller for time spent repairing my ceilings.

Signed-off-by: Hugh Dickins <hugh@veritas.com>
Signed-off-by: Andrew Morton <akpm@osdl.org>
Signed-off-by: Linus Torvalds <torvalds@osdl.org>
2005-04-19 13:29:15 -07:00
..
alpha Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
arm [PATCH] ARM: Add missing new file for bitops patch 2005-04-18 22:50:01 +01:00
arm26 Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
cris Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
frv Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
h8300 Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
i386 [PATCH] freepgt: free_pgtables use vma list 2005-04-19 13:29:15 -07:00
ia64 [PATCH] freepgt: free_pgtables use vma list 2005-04-19 13:29:15 -07:00
m32r Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
m68k [PATCH] M68k: Update defconfigs for 2.6.12-rc2 2005-04-18 10:47:34 -07:00
m68knommu Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
mips [PATCH] mips: remove #include <linux/audit.h> two times 2005-04-16 15:24:41 -07:00
parisc [PATCH] add Big Endian variants of ioread/iowrite 2005-04-16 15:25:54 -07:00
ppc [PATCH] ppc64: very basic desktop g5 sound support 2005-04-16 15:24:32 -07:00
ppc64 [PATCH] u32 vs. pm_message_t in ppc and radeon 2005-04-16 15:25:34 -07:00
s390 Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
sh [PATCH] fix u32 vs. pm_message_t in rest of the tree 2005-04-16 15:25:37 -07:00
sh64 Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
sparc [PATCH] sparc: Fix PTRACE_CONT bogosity 2005-04-17 18:03:11 -07:00
sparc64 [PATCH] sparc64: Fix stat 2005-04-18 15:13:15 -07:00
um [PATCH] uml: fix compilation for __CHOOSE_MODE addition 2005-04-16 15:25:39 -07:00
v850 Linux-2.6.12-rc2 2005-04-16 15:20:36 -07:00
x86_64 [PATCH] Fix u32 vs. pm_message_t in x86-64 2005-04-16 15:25:31 -07:00