time, signal: Protect resource use statistics with seqlock

Both times() and clock_gettime(CLOCK_PROCESS_CPUTIME_ID) have scalability issues on large systems, due to both functions being serialized with a lock. The lock protects against reporting a wrong value, due to a thread in the task group exiting, its statistics reporting up to the signal struct, and that exited task's statistics being counted twice (or not at all). Protecting that with a lock results in times() and clock_gettime() being completely serialized on large systems. This can be fixed by using a seqlock around the events that gather and propagate statistics. As an additional benefit, the protection code can be moved into thread_group_cputime(), slightly simplifying the calling functions. In the case of posix_cpu_clock_get_task() things can be simplified a lot, because the calling function already ensures that the task sticks around, and the rest is now taken care of in thread_group_cputime(). This way the statistics reporting code can run lockless. Signed-off-by: Rik van Riel <riel@redhat.com> Signed-off-by: Peter Zijlstra (Intel) <peterz@infradead.org> Cc: Alex Thorlton <athorlton@sgi.com> Cc: Andrew Morton <akpm@linux-foundation.org> Cc: Daeseok Youn <daeseok.youn@gmail.com> Cc: David Rientjes <rientjes@google.com> Cc: Dongsheng Yang <yangds.fnst@cn.fujitsu.com> Cc: Geert Uytterhoeven <geert@linux-m68k.org> Cc: Guillaume Morin <guillaume@morinfr.org> Cc: Ionut Alexa <ionut.m.alexa@gmail.com> Cc: Kees Cook <keescook@chromium.org> Cc: Linus Torvalds <torvalds@linux-foundation.org> Cc: Li Zefan <lizefan@huawei.com> Cc: Michal Hocko <mhocko@suse.cz> Cc: Michal Schmidt <mschmidt@redhat.com> Cc: Oleg Nesterov <oleg@redhat.com> Cc: Vladimir Davydov <vdavydov@parallels.com> Cc: umgwanakikbuti@gmail.com Cc: fweisbec@gmail.com Cc: srao@redhat.com Cc: lwoodman@redhat.com Cc: atheurer@redhat.com Link: http://lkml.kernel.org/r/20140816134010.26a9b572@annuminas.surriel.com Signed-off-by: Ingo Molnar <mingo@kernel.org>
author: Rik van Riel <riel@redhat.com> 2014-08-16 13:40:10 -0400
committer: Ingo Molnar <mingo@kernel.org> 2014-09-08 02:17:01 -0400
commit: e78c3496790ee8a36522a838b59b388e8a709e65 (patch)
tree: 0473b9ea676754d50b19eb1a862ac16fdffacbeb /kernel/time
parent: 90ed9cbe765ad358b3151a12b8bf889a3cbcd573 (diff)
1 files changed, 0 insertions, 14 deletions
diff --git a/kernel/time/posix-cpu-timers.c b/kernel/time/posix-cpu-timers.c
index 3b8946416a5f..492b986195d5 100644
--- a/kernel/time/posix-cpu-timers.c
+++ b/kernel/time/posix-cpu-timers.c
@@ -272,22 +272,8 @@ static int posix_cpu_clock_get_task(struct task_struct *tsk,
                if (same_thread_group(tsk, current))
                        err = cpu_clock_sample(which_clock, tsk, &rtn);
        } else {
-                unsigned long flags;
-                struct sighand_struct *sighand;
-                /*
-                 * while_each_thread() is not yet entirely RCU safe,
-                 * keep locking the group while sampling process
-                 * clock for now.
-                 */
-                sighand = lock_task_sighand(tsk, &flags);
-                if (!sighand)
-                        return err;
                if (tsk == current || thread_group_leader(tsk))
                        err = cpu_clock_sample_group(which_clock, tsk, &rtn);
-                unlock_task_sighand(tsk, &flags);
        }
        if (!err)
author	Rik van Riel <riel@redhat.com>	2014-08-16 13:40:10 -0400
committer	Ingo Molnar <mingo@kernel.org>	2014-09-08 02:17:01 -0400
commit	e78c3496790ee8a36522a838b59b388e8a709e65 (patch)
tree	0473b9ea676754d50b19eb1a862ac16fdffacbeb /kernel/time
parent	90ed9cbe765ad358b3151a12b8bf889a3cbcd573 (diff)

diff --git a/kernel/time/posix-cpu-timers.c b/kernel/time/posix-cpu-timers.c index 3b8946416a5f..492b986195d5 100644 --- a/kernel/time/posix-cpu-timers.c +++ b/kernel/time/posix-cpu-timers.c
@@ -272,22 +272,8 @@ static int posix_cpu_clock_get_task(struct task_struct *tsk,
272	if (same_thread_group(tsk, current))	272	if (same_thread_group(tsk, current))
273	err = cpu_clock_sample(which_clock, tsk, &rtn);	273	err = cpu_clock_sample(which_clock, tsk, &rtn);
274	} else {	274	} else {
275	unsigned long flags;
276	struct sighand_struct *sighand;
277
278	/*
279	* while_each_thread() is not yet entirely RCU safe,
280	* keep locking the group while sampling process
281	* clock for now.
282	*/
283	sighand = lock_task_sighand(tsk, &flags);
284	if (!sighand)
285	return err;
286
287	if (tsk == current \|\| thread_group_leader(tsk))	275	if (tsk == current \|\| thread_group_leader(tsk))
288	err = cpu_clock_sample_group(which_clock, tsk, &rtn);	276	err = cpu_clock_sample_group(which_clock, tsk, &rtn);
289
290	unlock_task_sighand(tsk, &flags);
291	}	277	}
292		278
293	if (!err)	279	if (!err)