[ltt-dev] Userspace RCU git fixes

Mathieu Desnoyers mathieu.desnoyers at efficios.com
Tue Sep 13 13:53:37 EDT 2011


* Paolo Bonzini (pbonzini at redhat.com) wrote:
> > Just to let you know that I pushed two updates into urcu: one fixes a
> > grace period hang caused by a missing wakeup in the synchronize_rcu QSBR
> > code. This appears to hit us due to the more fine-grained wakeup
> > code brought by Paolo. The wakeup was really missing from the
> > synchronize_rcu code (so Paolo's code just triggered an existing
> > problem). I thought it would be good to let you know the effect: grace
> > periods are delayed forever. This problem never appeared in a release (I
> > caught it before).
> 
> Good catch.  Why not use rcu_thread_offline/online in synchronize_rcu,
> instead of touching rcu_reader.ctr directly?  I had this in my QEMU
> branch but hadn't posted yet because it was meant as a cleanup only.

Yep. I think in the initial QSBR versions, we could remove one barrier
by inlining this, but I think I recall it was only a compiler barrier,
so I agree that it's better to make the code easier to manage than save
this possibly effect-less barrier (it is immediately followed by a mutex
lock and preceded by a mutex unlock).

Merged,

Thanks!

Mathieu

> 
> Paolo
> 
> ----------------------- 8< ---------------------
> 
> From 7ad6897f696034ef0651c912e43931a2b0bbe631 Mon Sep 17 00:00:00 2001
> From: Paolo Bonzini <pbonzini at redhat.com>
> Date: Mon, 12 Sep 2011 09:24:18 +0200
> Subject: [PATCH] urcu-qsbr: use rcu_thread_offline/rcu_thread_online instead of inlining them
> 
> ---
>  urcu-qsbr.c |   40 +++++++++++++++++-----------------------
>  1 files changed, 17 insertions(+), 23 deletions(-)
> 
> diff --git a/urcu-qsbr.c b/urcu-qsbr.c
> index 1dc9979..1adaa94 100644
> --- a/urcu-qsbr.c
> +++ b/urcu-qsbr.c
> @@ -208,21 +208,17 @@ void synchronize_rcu(void)
>  	was_online = rcu_reader.ctr;
>  
>  	/* All threads should read qparity before accessing data structure
> -	 * where new ptr points to.
> -	 */
> -	/* Write new ptr before changing the qparity */
> -	cmm_smp_mb();
> -
> -	/*
> +	 * where new ptr points to.  In the "then" case, rcu_thread_offline
> +	 * includes a memory barrier.
> +	 *
>  	 * Mark the writer thread offline to make sure we don't wait for
>  	 * our own quiescent state. This allows using synchronize_rcu()
>  	 * in threads registered as readers.
>  	 */
> -	if (was_online) {
> -		CMM_STORE_SHARED(rcu_reader.ctr, 0);
> -		cmm_smp_mb();	/* write rcu_reader.ctr before read futex */
> -		wake_up_gp();
> -	}
> +	if (was_online)
> +		rcu_thread_offline();
> +	else
> +		cmm_smp_mb();
>  
>  	mutex_lock(&rcu_gp_lock);
>  
> @@ -263,9 +259,9 @@ out:
>  	 * freed.
>  	 */
>  	if (was_online)
> -		_CMM_STORE_SHARED(rcu_reader.ctr,
> -				  CMM_LOAD_SHARED(rcu_gp_ctr));
> -	cmm_smp_mb();
> +		rcu_thread_online();
> +	else
> +		cmm_smp_mb();
>  }
>  #else /* !(CAA_BITS_PER_LONG < 64) */
>  void synchronize_rcu(void)
> @@ -279,12 +275,10 @@ void synchronize_rcu(void)
>  	 * our own quiescent state. This allows using synchronize_rcu()
>  	 * in threads registered as readers.
>  	 */
> -	cmm_smp_mb();
> -	if (was_online) {
> -		CMM_STORE_SHARED(rcu_reader.ctr, 0);
> -		cmm_smp_mb();	/* write rcu_reader.ctr before read futex */
> -		wake_up_gp();
> -	}
> +	if (was_online)
> +		rcu_thread_offline();
> +	else
> +		cmm_smp_mb();
>  
>  	mutex_lock(&rcu_gp_lock);
>  	if (cds_list_empty(&registry))
> @@ -294,9 +288,9 @@ out:
>  	mutex_unlock(&rcu_gp_lock);
>  
>  	if (was_online)
> -		_CMM_STORE_SHARED(rcu_reader.ctr,
> -				  CMM_LOAD_SHARED(rcu_gp_ctr));
> -	cmm_smp_mb();
> +		rcu_thread_online();
> +	else
> +		cmm_smp_mb();
>  }
>  #endif  /* !(CAA_BITS_PER_LONG < 64) */
>  
> -- 
> 1.7.6
> 

-- 
Mathieu Desnoyers
Operating System Efficiency R&D Consultant
EfficiOS Inc.
http://www.efficios.com




More information about the lttng-dev mailing list