[ltt-dev] [UST PATCH 2/3] ust-consumerd: fix exit race log corruption

Mathieu Desnoyers mathieu.desnoyers at efficios.com
Thu Apr 28 10:57:22 EDT 2011


* Nils Carlson (nils.carlson at ericsson.com) wrote:
> merged.

This is a step in the right direction, but we should not stop there: I
recommend to:

1 - create a linked list of active threads. list_add should be called
    after each pthread_create.
2 - remove the call to pthread_detach().
3 - before exit, iterate on each thread of the list, calling
    pthread_join for each of them.
4 - then we can remove the counter-based scheme proposed by this patch.

Because with this scheme, we still race between exit() and thread memory
free/teardown. It won't cause much problem in practice, but valgrind is
likely to complain.

Thanks,

Mathieu

>
> On Wed, 27 Apr 2011, Jason Wessel wrote:
>
>> In the following scenario on an SMP system the ust-consumerd can end
>> up not properly closing out file handles which leads to log
>> corruption:
>>  * usttrace -m -l small_quick_app_lots_of_malloc_and_free
>>  * The app completes and usttrace sees and sends the SIGTERM to ust-consumerd
>>  * The ust-consumerd main thread will exit and the _exit() handlers
>>    kills off the remaining pthreads without everything getting closed out
>>
>> The solution to the problem is to introduce an active_thread count for
>> the private ustconsumer_instance.  This counter will be zeroed out
>> when it is safe to completely shutdown the main thread, which will
>> subsequently run the _exit() handlers.
>>
>> Signed-off-by: Jason Wessel <jason.wessel at windriver.com>
>> ---
>> include/ust/ustconsumer.h       |    1 +
>> libustconsumer/libustconsumer.c |   10 +++++++++-
>> 2 files changed, 10 insertions(+), 1 deletions(-)
>>
>> diff --git a/include/ust/ustconsumer.h b/include/ust/ustconsumer.h
>> index f99f8f1..cde8440 100644
>> --- a/include/ust/ustconsumer.h
>> +++ b/include/ust/ustconsumer.h
>> @@ -89,6 +89,7 @@ struct ustconsumer_instance {
>> 	char *sock_path;
>> 	pthread_mutex_t mutex;
>> 	int active_buffers;
>> +	int active_threads;
>> };
>>
>> /**
>> diff --git a/libustconsumer/libustconsumer.c b/libustconsumer/libustconsumer.c
>> index 6f6d4bb..a723540 100644
>> --- a/libustconsumer/libustconsumer.c
>> +++ b/libustconsumer/libustconsumer.c
>> @@ -543,6 +543,10 @@ void *consumer_thread(void *arg)
>> 	int result;
>> 	sigset_t sigset;
>>
>> +	pthread_mutex_lock(&args->instance->mutex);
>> +	args->instance->active_threads++;
>> +	pthread_mutex_unlock(&args->instance->mutex);
>> +
>> 	if(args->instance->callbacks->on_new_thread)
>> 		args->instance->callbacks->on_new_thread(args->instance->callbacks);
>>
>> @@ -584,6 +588,10 @@ void *consumer_thread(void *arg)
>> 	if(args->instance->callbacks->on_close_thread)
>> 		args->instance->callbacks->on_close_thread(args->instance->callbacks);
>>
>> +	pthread_mutex_lock(&args->instance->mutex);
>> +	args->instance->active_threads--;
>> +	pthread_mutex_unlock(&args->instance->mutex);
>> +
>> 	free((void *)args->channel);
>> 	free(args);
>> 	return NULL;
>> @@ -735,7 +743,7 @@ int ustconsumer_start_instance(struct ustconsumer_instance *instance)
>>
>> 		if (instance->quit_program) {
>> 			pthread_mutex_lock(&instance->mutex);
>> -			if(instance->active_buffers == 0) {
>> +			if(instance->active_buffers == 0 && instance->active_threads == 0) {
>> 				pthread_mutex_unlock(&instance->mutex);
>> 				break;
>> 			}
>> -- 
>> 1.7.1
>>
>>
>> _______________________________________________
>> ltt-dev mailing list
>> ltt-dev at lists.casi.polymtl.ca
>> http://lists.casi.polymtl.ca/cgi-bin/mailman/listinfo/ltt-dev
>>

-- 
Mathieu Desnoyers
Operating System Efficiency R&D Consultant
EfficiOS Inc.
http://www.efficios.com




More information about the lttng-dev mailing list