Skip to content

Commit

Permalink
tracing: Have format file honor EVENT_FILE_FL_FREED
Browse files Browse the repository at this point in the history
commit b156040 upstream.

When eventfs was introduced, special care had to be done to coordinate the
freeing of the file meta data with the files that are exposed to user
space. The file meta data would have a ref count that is set when the file
is created and would be decremented and freed after the last user that
opened the file closed it. When the file meta data was to be freed, it
would set a flag (EVENT_FILE_FL_FREED) to denote that the file is freed,
and any new references made (like new opens or reads) would fail as it is
marked freed. This allowed other meta data to be freed after this flag was
set (under the event_mutex).

All the files that were dynamically created in the events directory had a
pointer to the file meta data and would call event_release() when the last
reference to the user space file was closed. This would be the time that it
is safe to free the file meta data.

A shortcut was made for the "format" file. It's i_private would point to
the "call" entry directly and not point to the file's meta data. This is
because all format files are the same for the same "call", so it was
thought there was no reason to differentiate them.  The other files
maintain state (like the "enable", "trigger", etc). But this meant if the
file were to disappear, the "format" file would be unaware of it.

This caused a race that could be trigger via the user_events test (that
would create dynamic events and free them), and running a loop that would
read the user_events format files:

In one console run:

 # cd tools/testing/selftests/user_events
 # while true; do ./ftrace_test; done

And in another console run:

 # cd /sys/kernel/tracing/
 # while true; do cat events/user_events/__test_event/format; done 2>/dev/null

With KASAN memory checking, it would trigger a use-after-free bug report
(which was a real bug). This was because the format file was not checking
the file's meta data flag "EVENT_FILE_FL_FREED", so it would access the
event that the file meta data pointed to after the event was freed.

After inspection, there are other locations that were found to not check
the EVENT_FILE_FL_FREED flag when accessing the trace_event_file. Add a
new helper function: event_file_file() that will make sure that the
event_mutex is held, and will return NULL if the trace_event_file has the
EVENT_FILE_FL_FREED flag set. Have the first reference of the struct file
pointer use event_file_file() and check for NULL. Later uses can still use
the event_file_data() helper function if the event_mutex is still held and
was not released since the event_file_file() call.

Link: https://lore.kernel.org/all/[email protected]/

Cc: [email protected]
Cc: Masami Hiramatsu <[email protected]>
Cc: Mathieu Desnoyers   <[email protected]>
Cc: Ajay Kaher <[email protected]>
Cc: Ilkka Naulapää    <[email protected]>
Cc: Linus Torvalds <[email protected]>
Cc: Al   Viro <[email protected]>
Cc: Dan Carpenter   <[email protected]>
Cc: Beau Belgrave <[email protected]>
Cc: Florian Fainelli  <[email protected]>
Cc: Alexey Makhalov    <[email protected]>
Cc: Vasavi Sirnapalli    <[email protected]>
Link: https://lore.kernel.org/[email protected]
Fixes: b63db58 ("eventfs/tracing: Add callback for release of an eventfs_inode")
Reported-by: Mathias Krause <[email protected]>
Tested-by: Mathias Krause <[email protected]>
Signed-off-by: Steven Rostedt (Google) <[email protected]>
[Resolve conflict due to lack of commit a1f157c ("tracing: Expand all
 ring buffers individually") which add tracing_update_buffers() in
event_enable_write(), that commit is more of a feature than a bugfix
and is not related to the problem fixed by this patch]
Signed-off-by: Zheng Yejian <[email protected]>
Signed-off-by: Greg Kroah-Hartman <[email protected]>
  • Loading branch information
rostedt authored and gregkh committed Sep 4, 2024
1 parent 9a9716b commit 4ed0375
Show file tree
Hide file tree
Showing 5 changed files with 49 additions and 19 deletions.
23 changes: 23 additions & 0 deletions kernel/trace/trace.h
Original file line number Diff line number Diff line change
Expand Up @@ -1555,6 +1555,29 @@ static inline void *event_file_data(struct file *filp)
extern struct mutex event_mutex;
extern struct list_head ftrace_events;

/*
* When the trace_event_file is the filp->i_private pointer,
* it must be taken under the event_mutex lock, and then checked
* if the EVENT_FILE_FL_FREED flag is set. If it is, then the
* data pointed to by the trace_event_file can not be trusted.
*
* Use the event_file_file() to access the trace_event_file from
* the filp the first time under the event_mutex and check for
* NULL. If it is needed to be retrieved again and the event_mutex
* is still held, then the event_file_data() can be used and it
* is guaranteed to be valid.
*/
static inline struct trace_event_file *event_file_file(struct file *filp)
{
struct trace_event_file *file;

lockdep_assert_held(&event_mutex);
file = READ_ONCE(file_inode(filp)->i_private);
if (!file || file->flags & EVENT_FILE_FL_FREED)
return NULL;
return file;
}

extern const struct file_operations event_trigger_fops;
extern const struct file_operations event_hist_fops;
extern const struct file_operations event_hist_debug_fops;
Expand Down
33 changes: 20 additions & 13 deletions kernel/trace/trace_events.c
Original file line number Diff line number Diff line change
Expand Up @@ -1386,12 +1386,12 @@ event_enable_read(struct file *filp, char __user *ubuf, size_t cnt,
char buf[4] = "0";

mutex_lock(&event_mutex);
file = event_file_data(filp);
file = event_file_file(filp);
if (likely(file))
flags = file->flags;
mutex_unlock(&event_mutex);

if (!file || flags & EVENT_FILE_FL_FREED)
if (!file)
return -ENODEV;

if (flags & EVENT_FILE_FL_ENABLED &&
Expand Down Expand Up @@ -1428,8 +1428,8 @@ event_enable_write(struct file *filp, const char __user *ubuf, size_t cnt,
case 1:
ret = -ENODEV;
mutex_lock(&event_mutex);
file = event_file_data(filp);
if (likely(file && !(file->flags & EVENT_FILE_FL_FREED)))
file = event_file_file(filp);
if (likely(file))
ret = ftrace_event_enable_disable(file, val);
mutex_unlock(&event_mutex);
break;
Expand Down Expand Up @@ -1538,7 +1538,8 @@ enum {

static void *f_next(struct seq_file *m, void *v, loff_t *pos)
{
struct trace_event_call *call = event_file_data(m->private);
struct trace_event_file *file = event_file_data(m->private);
struct trace_event_call *call = file->event_call;
struct list_head *common_head = &ftrace_common_fields;
struct list_head *head = trace_get_fields(call);
struct list_head *node = v;
Expand Down Expand Up @@ -1570,7 +1571,8 @@ static void *f_next(struct seq_file *m, void *v, loff_t *pos)

static int f_show(struct seq_file *m, void *v)
{
struct trace_event_call *call = event_file_data(m->private);
struct trace_event_file *file = event_file_data(m->private);
struct trace_event_call *call = file->event_call;
struct ftrace_event_field *field;
const char *array_descriptor;

Expand Down Expand Up @@ -1625,12 +1627,14 @@ static int f_show(struct seq_file *m, void *v)

static void *f_start(struct seq_file *m, loff_t *pos)
{
struct trace_event_file *file;
void *p = (void *)FORMAT_HEADER;
loff_t l = 0;

/* ->stop() is called even if ->start() fails */
mutex_lock(&event_mutex);
if (!event_file_data(m->private))
file = event_file_file(m->private);
if (!file)
return ERR_PTR(-ENODEV);

while (l < *pos && p)
Expand Down Expand Up @@ -1704,8 +1708,8 @@ event_filter_read(struct file *filp, char __user *ubuf, size_t cnt,
trace_seq_init(s);

mutex_lock(&event_mutex);
file = event_file_data(filp);
if (file && !(file->flags & EVENT_FILE_FL_FREED))
file = event_file_file(filp);
if (file)
print_event_filter(file, s);
mutex_unlock(&event_mutex);

Expand Down Expand Up @@ -1734,9 +1738,13 @@ event_filter_write(struct file *filp, const char __user *ubuf, size_t cnt,
return PTR_ERR(buf);

mutex_lock(&event_mutex);
file = event_file_data(filp);
if (file)
err = apply_event_filter(file, buf);
file = event_file_file(filp);
if (file) {
if (file->flags & EVENT_FILE_FL_FREED)
err = -ENODEV;
else
err = apply_event_filter(file, buf);
}
mutex_unlock(&event_mutex);

kfree(buf);
Expand Down Expand Up @@ -2451,7 +2459,6 @@ static int event_callback(const char *name, umode_t *mode, void **data,
if (strcmp(name, "format") == 0) {
*mode = TRACE_MODE_READ;
*fops = &ftrace_event_format_fops;
*data = call;
return 1;
}

Expand Down
4 changes: 2 additions & 2 deletions kernel/trace/trace_events_hist.c
Original file line number Diff line number Diff line change
Expand Up @@ -5609,7 +5609,7 @@ static int hist_show(struct seq_file *m, void *v)

mutex_lock(&event_mutex);

event_file = event_file_data(m->private);
event_file = event_file_file(m->private);
if (unlikely(!event_file)) {
ret = -ENODEV;
goto out_unlock;
Expand Down Expand Up @@ -5888,7 +5888,7 @@ static int hist_debug_show(struct seq_file *m, void *v)

mutex_lock(&event_mutex);

event_file = event_file_data(m->private);
event_file = event_file_file(m->private);
if (unlikely(!event_file)) {
ret = -ENODEV;
goto out_unlock;
Expand Down
2 changes: 1 addition & 1 deletion kernel/trace/trace_events_inject.c
Original file line number Diff line number Diff line change
Expand Up @@ -299,7 +299,7 @@ event_inject_write(struct file *filp, const char __user *ubuf, size_t cnt,
strim(buf);

mutex_lock(&event_mutex);
file = event_file_data(filp);
file = event_file_file(filp);
if (file) {
call = file->event_call;
size = parse_entry(buf, call, &entry);
Expand Down
6 changes: 3 additions & 3 deletions kernel/trace/trace_events_trigger.c
Original file line number Diff line number Diff line change
Expand Up @@ -159,7 +159,7 @@ static void *trigger_start(struct seq_file *m, loff_t *pos)

/* ->stop() is called even if ->start() fails */
mutex_lock(&event_mutex);
event_file = event_file_data(m->private);
event_file = event_file_file(m->private);
if (unlikely(!event_file))
return ERR_PTR(-ENODEV);

Expand Down Expand Up @@ -213,7 +213,7 @@ static int event_trigger_regex_open(struct inode *inode, struct file *file)

mutex_lock(&event_mutex);

if (unlikely(!event_file_data(file))) {
if (unlikely(!event_file_file(file))) {
mutex_unlock(&event_mutex);
return -ENODEV;
}
Expand Down Expand Up @@ -293,7 +293,7 @@ static ssize_t event_trigger_regex_write(struct file *file,
strim(buf);

mutex_lock(&event_mutex);
event_file = event_file_data(file);
event_file = event_file_file(file);
if (unlikely(!event_file)) {
mutex_unlock(&event_mutex);
kfree(buf);
Expand Down

0 comments on commit 4ed0375

Please sign in to comment.