diff options
author | Steven Rostedt <srostedt@redhat.com> | 2009-03-27 11:00:29 -0400 |
---|---|---|
committer | Steven Rostedt <rostedt@goodmis.org> | 2009-07-07 18:36:12 -0400 |
commit | 77ae365eca895061c8bf2b2e3ae1d9ea62869739 (patch) | |
tree | fc808698a4f2869b45ef4e8c958cbab183a7ad37 /kernel/trace/trace.c | |
parent | 3adc54fa82a68be1cd1ac82ad786ee362796e50a (diff) |
ring-buffer: make lockless
This patch converts the ring buffers into a completely lockless
buffer recording system. The read side still takes locks since
we still serialize readers. But the writers are the ones that
must be lockless (those can happen in NMIs).
The main change is to the "head_page" pointer. We write to the
tail, and read from the head. The "head_page" pointer in the cpu
buffer is now just a reference to where to look. The real head
page is now kept in the head_page->list->prev->next pointer.
That is, in the list head of the previous page we set flags.
The list pages are allocated to be aligned such that the lowest
significant bits are always zero pointing to the list. This gives
us play to put in flags to their pointers.
bit 0: set when the page is a head page
bit 1: set when the writer is moving the page (for overwrite mode)
cmpxchg is used to update the pointer.
When the writer wraps the buffer and the tail meets the head,
in overwrite mode, the writer must move the head page forward.
It first uses cmpxchg to change the pointer flag from 1 to 2.
Once this is done, the reader on another CPU will not take the
page from the buffer.
The writers need to protect against interrupts (we don't bother with
disabling interrupts because NMIs are allowed to write too).
After the writer sets the pointer flag to 2, it takes care to
manage interrupts coming in. This is discribed in detail within the
comments of the code.
Changes in version 2:
- Let reader reset entries value of header page.
- Fix tail page passing commit page on reader page test.
- Always increment entries and write counter in rb_tail_page_update
- Add safety check in rb_set_commit_to_write to break out of infinite loop
- add mask in rb_is_reader_page
[ Impact: lock free writing to the ring buffer ]
Signed-off-by: Steven Rostedt <srostedt@redhat.com>
Diffstat (limited to 'kernel/trace/trace.c')
-rw-r--r-- | kernel/trace/trace.c | 3 |
1 files changed, 0 insertions, 3 deletions
diff --git a/kernel/trace/trace.c b/kernel/trace/trace.c index bdb3afc8b306..b591f7a1bd7b 100644 --- a/kernel/trace/trace.c +++ b/kernel/trace/trace.c | |||
@@ -3630,9 +3630,6 @@ tracing_stats_read(struct file *filp, char __user *ubuf, | |||
3630 | cnt = ring_buffer_commit_overrun_cpu(tr->buffer, cpu); | 3630 | cnt = ring_buffer_commit_overrun_cpu(tr->buffer, cpu); |
3631 | trace_seq_printf(s, "commit overrun: %ld\n", cnt); | 3631 | trace_seq_printf(s, "commit overrun: %ld\n", cnt); |
3632 | 3632 | ||
3633 | cnt = ring_buffer_nmi_dropped_cpu(tr->buffer, cpu); | ||
3634 | trace_seq_printf(s, "nmi dropped: %ld\n", cnt); | ||
3635 | |||
3636 | count = simple_read_from_buffer(ubuf, count, ppos, s->buffer, s->len); | 3633 | count = simple_read_from_buffer(ubuf, count, ppos, s->buffer, s->len); |
3637 | 3634 | ||
3638 | kfree(s); | 3635 | kfree(s); |