mm, sched: Drop voluntary schedule from might_fault()
authorMichael S. Tsirkin <mst@redhat.com>
Sun, 26 May 2013 14:32:13 +0000 (17:32 +0300)
committerIngo Molnar <mingo@kernel.org>
Tue, 28 May 2013 07:41:11 +0000 (09:41 +0200)
might_fault() is called from functions like copy_to_user()
which most callers expect to be very fast, like a couple of
instructions.

So functions like memcpy_toiovec() call them many times in a loop.

But might_fault() calls might_sleep() and with CONFIG_PREEMPT_VOLUNTARY
this results in a function call.

Let's not do this - just call __might_sleep() that produces
a diagnostic for sleep within atomic, but drop
might_preempt().

Here's a test sending traffic between the VM and the host,
host is built with CONFIG_PREEMPT_VOLUNTARY:

 before:
incoming: 7122.77   Mb/s
outgoing: 8480.37   Mb/s

 after:
incoming: 8619.24   Mb/s
outgoing: 9455.42   Mb/s

As a side effect, this fixes an issue pointed
out by Ingo: might_fault might schedule differently
depending on PROVE_LOCKING. Now there's no
preemption point in both cases, so it's consistent.

Signed-off-by: Michael S. Tsirkin <mst@redhat.com>
Signed-off-by: Peter Zijlstra <peterz@infradead.org>
Cc: Linus Torvalds <torvalds@linux-foundation.org>
Cc: Andrew Morton <akpm@linux-foundation.org>
Cc: Peter Zijlstra <a.p.zijlstra@chello.nl>
Link: http://lkml.kernel.org/r/1369577426-26721-10-git-send-email-mst@redhat.com
Signed-off-by: Ingo Molnar <mingo@kernel.org>
include/linux/kernel.h
mm/memory.c

index e9ef6d6b51d5b07f6471e96dc20efa5685416220..24719eaa1209f5c68dda24bbbf0ec46a8e234598 100644 (file)
@@ -198,7 +198,7 @@ void might_fault(void);
 #else
 static inline void might_fault(void)
 {
-       might_sleep();
+       __might_sleep(__FILE__, __LINE__, 0);
 }
 #endif
 
index 6dc1882fbd725c61badb4fd072418c7330001ce1..c1f190f51f6f2d2ff62a4851e2fd37cbdcaacdc7 100644 (file)
@@ -4222,7 +4222,8 @@ void might_fault(void)
        if (segment_eq(get_fs(), KERNEL_DS))
                return;
 
-       might_sleep();
+       __might_sleep(__FILE__, __LINE__, 0);
+
        /*
         * it would be nicer only to annotate paths which are not under
         * pagefault_disable, however that requires a larger audit and