diff mbox

[v3] semaphore: fix a hangup problem under load on NetBSD hosts.

Message ID 1372841894-10634-1-git-send-email-tsutsui@ceres.dti.ne.jp
State New
Headers show

Commit Message

Izumi Tsutsui July 3, 2013, 8:58 a.m. UTC
Fix following bugs in "fallback implementation of counting semaphores
with mutex+condvar" added in c166cb72f1676855816340666c3b618beef4b976:
 - waiting threads are not restarted properly if more than one threads
   are waiting unblock signals in qemu_sem_timedwait()
 - possible missing pthread_cond_signal(3) calls when waiting threads
   are returned by ETIMEDOUT
 - fix an uninitialized variable
The problem is analyzed by and fix is provided by Noriyuki Soda.

Also put additional cleanup suggested by Laszlo Ersek:
 - make QemuSemaphore.count unsigned (it won't be negative)
 - check a return value of in pthread_cond_wait() in qemu_sem_wait()

Signed-off-by: Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>
Reviewed-by: Laszlo Ersek <lersek@redhat.com>
---

 v3:
 - fix a missed assignment and actually check a retval of pthread_cond_wait()

 v2:
 - make QemuSemaphore.count unsigned (it won't be negative)
 - also eliminate checks for negative count values
 - check a return value of in pthread_cond_wait() in qemu_sem_wait()

 include/qemu/thread-posix.h |  2 +-
 util/qemu-thread-posix.c    | 28 ++++++++++++++++------------
 2 files changed, 17 insertions(+), 13 deletions(-)

Comments

Laszlo Ersek July 3, 2013, 9:41 a.m. UTC | #1
On 07/03/13 10:58, Izumi Tsutsui wrote:
> Fix following bugs in "fallback implementation of counting semaphores
> with mutex+condvar" added in c166cb72f1676855816340666c3b618beef4b976:
>  - waiting threads are not restarted properly if more than one threads
>    are waiting unblock signals in qemu_sem_timedwait()
>  - possible missing pthread_cond_signal(3) calls when waiting threads
>    are returned by ETIMEDOUT
>  - fix an uninitialized variable
> The problem is analyzed by and fix is provided by Noriyuki Soda.
> 
> Also put additional cleanup suggested by Laszlo Ersek:
>  - make QemuSemaphore.count unsigned (it won't be negative)
>  - check a return value of in pthread_cond_wait() in qemu_sem_wait()
> 
> Signed-off-by: Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>
> Reviewed-by: Laszlo Ersek <lersek@redhat.com>
> ---
> 
>  v3:
>  - fix a missed assignment and actually check a retval of pthread_cond_wait()

Compared v3 against v2.

Reviewed-by: Laszlo Ersek <lersek@redhat.com>

Laszlo
Brad Smith Aug. 1, 2013, 3:24 a.m. UTC | #2
On 03/07/13 5:41 AM, Laszlo Ersek wrote:
> On 07/03/13 10:58, Izumi Tsutsui wrote:
>> Fix following bugs in "fallback implementation of counting semaphores
>> with mutex+condvar" added in c166cb72f1676855816340666c3b618beef4b976:
>>   - waiting threads are not restarted properly if more than one threads
>>     are waiting unblock signals in qemu_sem_timedwait()
>>   - possible missing pthread_cond_signal(3) calls when waiting threads
>>     are returned by ETIMEDOUT
>>   - fix an uninitialized variable
>> The problem is analyzed by and fix is provided by Noriyuki Soda.
>>
>> Also put additional cleanup suggested by Laszlo Ersek:
>>   - make QemuSemaphore.count unsigned (it won't be negative)
>>   - check a return value of in pthread_cond_wait() in qemu_sem_wait()
>>
>> Signed-off-by: Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>
>> Reviewed-by: Laszlo Ersek <lersek@redhat.com>
>> ---
>>
>>   v3:
>>   - fix a missed assignment and actually check a retval of pthread_cond_wait()
>
> Compared v3 against v2.
>
> Reviewed-by: Laszlo Ersek <lersek@redhat.com>
>
> Laszlo

This patch seems to have been dropped.
Paolo Bonzini Aug. 1, 2013, 10:33 p.m. UTC | #3
On 08/01/2013 05:24 AM, Brad wrote:
> On 03/07/13 5:41 AM, Laszlo Ersek wrote:
>> On 07/03/13 10:58, Izumi Tsutsui wrote:
>>> Fix following bugs in "fallback implementation of counting semaphores
>>> with mutex+condvar" added in c166cb72f1676855816340666c3b618beef4b976:
>>>   - waiting threads are not restarted properly if more than one threads
>>>     are waiting unblock signals in qemu_sem_timedwait()
>>>   - possible missing pthread_cond_signal(3) calls when waiting threads
>>>     are returned by ETIMEDOUT
>>>   - fix an uninitialized variable
>>> The problem is analyzed by and fix is provided by Noriyuki Soda.
>>>
>>> Also put additional cleanup suggested by Laszlo Ersek:
>>>   - make QemuSemaphore.count unsigned (it won't be negative)
>>>   - check a return value of in pthread_cond_wait() in qemu_sem_wait()
>>>
>>> Signed-off-by: Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>
>>> Reviewed-by: Laszlo Ersek <lersek@redhat.com>
>>> ---
>>>
>>>   v3:
>>>   - fix a missed assignment and actually check a retval of
>>> pthread_cond_wait()
>>
>> Compared v3 against v2.
>>
>> Reviewed-by: Laszlo Ersek <lersek@redhat.com>
>>
>> Laszlo
>
> This patch seems to have been dropped.

CCing Anthony and qemu-stable.

Paolo
Paolo Bonzini Aug. 5, 2013, 4:22 p.m. UTC | #4
On 08/02/2013 12:33 AM, Paolo Bonzini wrote:
> On 08/01/2013 05:24 AM, Brad wrote:
>> On 03/07/13 5:41 AM, Laszlo Ersek wrote:
>>> On 07/03/13 10:58, Izumi Tsutsui wrote:
>>>> Fix following bugs in "fallback implementation of counting semaphores
>>>> with mutex+condvar" added in c166cb72f1676855816340666c3b618beef4b976:
>>>>   - waiting threads are not restarted properly if more than one threads
>>>>     are waiting unblock signals in qemu_sem_timedwait()
>>>>   - possible missing pthread_cond_signal(3) calls when waiting threads
>>>>     are returned by ETIMEDOUT
>>>>   - fix an uninitialized variable
>>>> The problem is analyzed by and fix is provided by Noriyuki Soda.
>>>>
>>>> Also put additional cleanup suggested by Laszlo Ersek:
>>>>   - make QemuSemaphore.count unsigned (it won't be negative)
>>>>   - check a return value of in pthread_cond_wait() in qemu_sem_wait()
>>>>
>>>> Signed-off-by: Izumi Tsutsui <tsutsui@ceres.dti.ne.jp>
>>>> Reviewed-by: Laszlo Ersek <lersek@redhat.com>
>>>> ---
>>>>
>>>>   v3:
>>>>   - fix a missed assignment and actually check a retval of
>>>> pthread_cond_wait()
>>>
>>> Compared v3 against v2.
>>>
>>> Reviewed-by: Laszlo Ersek <lersek@redhat.com>
>>>
>>> Laszlo
>>
>> This patch seems to have been dropped.
>
> CCing Anthony and qemu-stable.

And bumping priority further.

Paolo
Anthony Liguori Aug. 14, 2013, 4:29 p.m. UTC | #5
Applied.  Thanks.

Regards,

Anthony Liguori
diff mbox

Patch

diff --git a/include/qemu/thread-posix.h b/include/qemu/thread-posix.h
index 0f30dcc..361566a 100644
--- a/include/qemu/thread-posix.h
+++ b/include/qemu/thread-posix.h
@@ -15,7 +15,7 @@  struct QemuSemaphore {
 #if defined(__APPLE__) || defined(__NetBSD__)
     pthread_mutex_t lock;
     pthread_cond_t cond;
-    int count;
+    unsigned int count;
 #else
     sem_t sem;
 #endif
diff --git a/util/qemu-thread-posix.c b/util/qemu-thread-posix.c
index 4489abf..4de133e 100644
--- a/util/qemu-thread-posix.c
+++ b/util/qemu-thread-posix.c
@@ -170,12 +170,11 @@  void qemu_sem_post(QemuSemaphore *sem)
 
 #if defined(__APPLE__) || defined(__NetBSD__)
     pthread_mutex_lock(&sem->lock);
-    if (sem->count == INT_MAX) {
+    if (sem->count == UINT_MAX) {
         rc = EINVAL;
-    } else if (sem->count++ < 0) {
-        rc = pthread_cond_signal(&sem->cond);
     } else {
-        rc = 0;
+        sem->count++;
+        rc = pthread_cond_signal(&sem->cond);
     }
     pthread_mutex_unlock(&sem->lock);
     if (rc != 0) {
@@ -207,19 +206,21 @@  int qemu_sem_timedwait(QemuSemaphore *sem, int ms)
     struct timespec ts;
 
 #if defined(__APPLE__) || defined(__NetBSD__)
+    rc = 0;
     compute_abs_deadline(&ts, ms);
     pthread_mutex_lock(&sem->lock);
-    --sem->count;
-    while (sem->count < 0) {
+    while (sem->count == 0) {
         rc = pthread_cond_timedwait(&sem->cond, &sem->lock, &ts);
         if (rc == ETIMEDOUT) {
-            ++sem->count;
             break;
         }
         if (rc != 0) {
             error_exit(rc, __func__);
         }
     }
+    if (rc != ETIMEDOUT) {
+        --sem->count;
+    }
     pthread_mutex_unlock(&sem->lock);
     return (rc == ETIMEDOUT ? -1 : 0);
 #else
@@ -249,16 +250,19 @@  int qemu_sem_timedwait(QemuSemaphore *sem, int ms)
 
 void qemu_sem_wait(QemuSemaphore *sem)
 {
+    int rc;
+
 #if defined(__APPLE__) || defined(__NetBSD__)
     pthread_mutex_lock(&sem->lock);
-    --sem->count;
-    while (sem->count < 0) {
-        pthread_cond_wait(&sem->cond, &sem->lock);
+    while (sem->count == 0) {
+        rc = pthread_cond_wait(&sem->cond, &sem->lock);
+        if (rc != 0) {
+            error_exit(rc, __func__);
+        }
     }
+    --sem->count;
     pthread_mutex_unlock(&sem->lock);
 #else
-    int rc;
-
     do {
         rc = sem_wait(&sem->sem);
     } while (rc == -1 && errno == EINTR);