diff mbox

sort_heap complexity guarantee

Message ID 543302DF.7080607@gmail.com
State New
Headers show

Commit Message

François Dumont Oct. 6, 2014, 9 p.m. UTC
On 05/10/2014 22:54, Marc Glisse wrote:
> On Sun, 5 Oct 2014, François Dumont wrote:
>
>>    I took a look at PR 61217 regarding pop_heap complexity guarantee. 
>> Looks like we have no test to check complexity of our algos so I 
>> start writing some starting with the heap operations. I found no 
>> issue with make_heap, push_heap and pop_heap despite what the bug 
>> report is saying however the attached testcase for sort_heap is failing.
>>
>>    Standard is saying std::sort_heap shall use less than N * log(N) 
>> comparisons but with my test using 1000 random values the test is 
>> showing:
>>
>> 8687 comparisons on 6907.76 max allowed
>>
>>    Is this a known issue of sort_heap ? Do you confirm that the test 
>> is valid ?
>
> I would first look for confirmation that the standard didn't just 
> forget a big-O or something. I would expect an implementation as n 
> calls to pop_heap to be legal, and if pop_heap makes 2*log(n) 
> comparisons, that naively sums to too much. And I don't expect the 
> standard to contain an advanced amortized analysis or anything like 
> that...
>
Good point, with n calls to pop_heap it means that limit must be 
2*log(1) + 2*log(2) +... + 2*log(n) which is 2*log(n!) and  which is 
also necessarily < 2*n*log(n). I guess Standard comittee has forgotten 
the factor 2 in the limit so this is what I am using as limit in the 
final test, unless someone prefer the stricter 2*log(n!) ?

Ok to commit those new tests ?

2014-10-06  François Dumont  <fdumont@gcc.gnu.org>

     * testsuite/util/testsuite_counter_type.h
     (counter_type::operator<(const counter_type&)): Update
     less_compare_count when called.
     * testsuite/25_algorithms/make_heap/complexity.cc: New.
     * testsuite/25_algorithms/pop_heap/complexity.cc: New.
     * testsuite/25_algorithms/push_heap/complexity.cc: New.
     * testsuite/25_algorithms/sort_heap/complexity.cc: New.


Tested under Linux x86_64.

François

Comments

Daniel Krügler Oct. 6, 2014, 9:05 p.m. UTC | #1
2014-10-06 23:00 GMT+02:00 François Dumont <frs.dumont@gmail.com>:
> On 05/10/2014 22:54, Marc Glisse wrote:
>>
>> On Sun, 5 Oct 2014, François Dumont wrote:
>>
>>>    I took a look at PR 61217 regarding pop_heap complexity guarantee.
>>> Looks like we have no test to check complexity of our algos so I start
>>> writing some starting with the heap operations. I found no issue with
>>> make_heap, push_heap and pop_heap despite what the bug report is saying
>>> however the attached testcase for sort_heap is failing.
>>>
>>>    Standard is saying std::sort_heap shall use less than N * log(N)
>>> comparisons but with my test using 1000 random values the test is showing:
>>>
>>> 8687 comparisons on 6907.76 max allowed
>>>
>>>    Is this a known issue of sort_heap ? Do you confirm that the test is
>>> valid ?
>>
>> I would first look for confirmation that the standard didn't just forget a
>> big-O or something. I would expect an implementation as n calls to pop_heap
>> to be legal, and if pop_heap makes 2*log(n) comparisons, that naively sums
>> to too much. And I don't expect the standard to contain an advanced
>> amortized analysis or anything like that...
>>
> Good point, with n calls to pop_heap it means that limit must be 2*log(1) +
> 2*log(2) +... + 2*log(n) which is 2*log(n!) and  which is also necessarily <
> 2*n*log(n). I guess Standard comittee has forgotten the factor 2 in the
> limit so this is what I am using as limit in the final test, unless someone
> prefer the stricter 2*log(n!) ?

François, could you please submit a corresponding LWG issue by sending
an email using the recipe described here:

http://www.open-std.org/jtc1/sc22/wg21/docs/lwg-active.html#submit_issue

?

Thanks,

- Daniel
François Dumont Oct. 7, 2014, 9:11 p.m. UTC | #2
On 06/10/2014 23:05, Daniel Krügler wrote:
> 2014-10-06 23:00 GMT+02:00 François Dumont <frs.dumont@gmail.com>:
>> On 05/10/2014 22:54, Marc Glisse wrote:
>>> On Sun, 5 Oct 2014, François Dumont wrote:
>>>
>>>>     I took a look at PR 61217 regarding pop_heap complexity guarantee.
>>>> Looks like we have no test to check complexity of our algos so I start
>>>> writing some starting with the heap operations. I found no issue with
>>>> make_heap, push_heap and pop_heap despite what the bug report is saying
>>>> however the attached testcase for sort_heap is failing.
>>>>
>>>>     Standard is saying std::sort_heap shall use less than N * log(N)
>>>> comparisons but with my test using 1000 random values the test is showing:
>>>>
>>>> 8687 comparisons on 6907.76 max allowed
>>>>
>>>>     Is this a known issue of sort_heap ? Do you confirm that the test is
>>>> valid ?
>>> I would first look for confirmation that the standard didn't just forget a
>>> big-O or something. I would expect an implementation as n calls to pop_heap
>>> to be legal, and if pop_heap makes 2*log(n) comparisons, that naively sums
>>> to too much. And I don't expect the standard to contain an advanced
>>> amortized analysis or anything like that...
>>>
>> Good point, with n calls to pop_heap it means that limit must be 2*log(1) +
>> 2*log(2) +... + 2*log(n) which is 2*log(n!) and  which is also necessarily <
>> 2*n*log(n). I guess Standard comittee has forgotten the factor 2 in the
>> limit so this is what I am using as limit in the final test, unless someone
>> prefer the stricter 2*log(n!) ?
> François, could you please submit a corresponding LWG issue by sending
> an email using the recipe described here:
>
> http://www.open-std.org/jtc1/sc22/wg21/docs/lwg-active.html#submit_issue
>
> ?
>
I just did requesting to use 2N log(N).

And is it ok to commit those ?

François
Daniel Krügler Oct. 7, 2014, 9:13 p.m. UTC | #3
2014-10-07 23:11 GMT+02:00 François Dumont <frs.dumont@gmail.com>:
> On 06/10/2014 23:05, Daniel Krügler wrote:
>> François, could you please submit a corresponding LWG issue by sending
>> an email using the recipe described here:
>>
>> http://www.open-std.org/jtc1/sc22/wg21/docs/lwg-active.html#submit_issue
>>
>> ?
>>
> I just did requesting to use 2N log(N).
>
> And is it ok to commit those ?

Looks fine to me - Thanks!

- Daniel
Jonathan Wakely Oct. 8, 2014, 8:43 a.m. UTC | #4
On 06/10/14 23:00 +0200, François Dumont wrote:
>Good point, with n calls to pop_heap it means that limit must be 
>2*log(1) + 2*log(2) +... + 2*log(n) which is 2*log(n!) and  which is 
>also necessarily < 2*n*log(n). I guess Standard comittee has forgotten 
>the factor 2 in the limit so this is what I am using as limit in the 
>final test, unless someone prefer the stricter 2*log(n!) ?
>
>Ok to commit those new tests ?

Yes please - thanks.
Marc Glisse Oct. 18, 2014, 7:24 a.m. UTC | #5
On Mon, 6 Oct 2014, François Dumont wrote:

>    * testsuite/25_algorithms/push_heap/complexity.cc: New.

This test is randomly failing in about 1% to 2% of cases.
diff mbox

Patch

Index: testsuite/25_algorithms/make_heap/complexity.cc
===================================================================
--- testsuite/25_algorithms/make_heap/complexity.cc	(revision 0)
+++ testsuite/25_algorithms/make_heap/complexity.cc	(working copy)
@@ -0,0 +1,50 @@ 
+// Copyright (C) 2014 Free Software Foundation, Inc.
+//
+// This file is part of the GNU ISO C++ Library.  This library is free
+// software; you can redistribute it and/or modify it under the
+// terms of the GNU General Public License as published by the
+// Free Software Foundation; either version 3, or (at your option)
+// any later version.
+
+// This library is distributed in the hope that it will be useful,
+// but WITHOUT ANY WARRANTY; without even the implied warranty of
+// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+// GNU General Public License for more details.
+
+// You should have received a copy of the GNU General Public License along
+// with this library; see the file COPYING3.  If not see
+// <http://www.gnu.org/licenses/>.
+
+// { dg-options "-std=gnu++11" }
+
+#include <random>
+#include <vector>
+#include <algorithm>
+
+#include <testsuite_counter_type.h>
+#include <testsuite_hooks.h>
+
+void test01()
+{
+  using __gnu_test::counter_type;
+  const std::size_t nb_values = 1000;
+
+  std::random_device dev;
+  std::uniform_int_distribution<int> dist;
+  std::vector<counter_type> values;
+  values.reserve(nb_values);
+  for (std::size_t i = 0; i != nb_values; ++i)
+    values.push_back(dist(dev));
+
+  counter_type::reset();
+
+  std::make_heap(values.begin(), values.end());
+
+  VERIFY( counter_type::less_compare_count <= 3.0 * nb_values );
+}
+
+int main()
+{
+  test01();
+  return 0;
+}
Index: testsuite/25_algorithms/pop_heap/complexity.cc
===================================================================
--- testsuite/25_algorithms/pop_heap/complexity.cc	(revision 0)
+++ testsuite/25_algorithms/pop_heap/complexity.cc	(working copy)
@@ -0,0 +1,53 @@ 
+// Copyright (C) 2014 Free Software Foundation, Inc.
+//
+// This file is part of the GNU ISO C++ Library.  This library is free
+// software; you can redistribute it and/or modify it under the
+// terms of the GNU General Public License as published by the
+// Free Software Foundation; either version 3, or (at your option)
+// any later version.
+
+// This library is distributed in the hope that it will be useful,
+// but WITHOUT ANY WARRANTY; without even the implied warranty of
+// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+// GNU General Public License for more details.
+
+// You should have received a copy of the GNU General Public License along
+// with this library; see the file COPYING3.  If not see
+// <http://www.gnu.org/licenses/>.
+
+// { dg-options "-std=gnu++11" }
+
+#include <cmath>
+#include <random>
+#include <vector>
+#include <algorithm>
+
+#include <testsuite_counter_type.h>
+#include <testsuite_hooks.h>
+
+void test01()
+{
+  using __gnu_test::counter_type;
+  const std::size_t nb_values = 1000;
+
+  std::random_device dev;
+  std::uniform_int_distribution<int> dist;
+  std::vector<counter_type> values;
+  values.reserve(nb_values);
+  for (std::size_t i = 0; i != nb_values; ++i)
+    values.push_back(dist(dev));
+
+  std::make_heap(values.begin(), values.end());
+
+  counter_type::reset();
+
+  std::pop_heap(values.begin(), values.end());
+
+  VERIFY( counter_type::less_compare_count <= 2.0 * std::log(nb_values) );
+}
+
+int main()
+{
+  test01();
+  return 0;
+}
Index: testsuite/25_algorithms/push_heap/complexity.cc
===================================================================
--- testsuite/25_algorithms/push_heap/complexity.cc	(revision 0)
+++ testsuite/25_algorithms/push_heap/complexity.cc	(working copy)
@@ -0,0 +1,54 @@ 
+// Copyright (C) 2014 Free Software Foundation, Inc.
+//
+// This file is part of the GNU ISO C++ Library.  This library is free
+// software; you can redistribute it and/or modify it under the
+// terms of the GNU General Public License as published by the
+// Free Software Foundation; either version 3, or (at your option)
+// any later version.
+
+// This library is distributed in the hope that it will be useful,
+// but WITHOUT ANY WARRANTY; without even the implied warranty of
+// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+// GNU General Public License for more details.
+
+// You should have received a copy of the GNU General Public License along
+// with this library; see the file COPYING3.  If not see
+// <http://www.gnu.org/licenses/>.
+
+// { dg-options "-std=gnu++11" }
+
+#include <cmath>
+#include <random>
+#include <vector>
+#include <algorithm>
+
+#include <testsuite_counter_type.h>
+#include <testsuite_hooks.h>
+
+void test01()
+{
+  using __gnu_test::counter_type;
+  const std::size_t nb_values = 1000;
+
+  std::random_device dev;
+  std::uniform_int_distribution<int> dist;
+  std::vector<counter_type> values;
+  values.reserve(nb_values);
+  for (std::size_t i = 0; i != nb_values; ++i)
+    values.push_back(dist(dev));
+
+  std::make_heap(values.begin(), values.end());
+  values.push_back(dist(dev));
+
+  counter_type::reset();
+
+  std::push_heap(values.begin(), values.end());
+
+  VERIFY( counter_type::less_compare_count <= std::log(values.size()) );
+}
+
+int main()
+{
+  test01();
+  return 0;
+}
Index: testsuite/25_algorithms/sort_heap/complexity.cc
===================================================================
--- testsuite/25_algorithms/sort_heap/complexity.cc	(revision 0)
+++ testsuite/25_algorithms/sort_heap/complexity.cc	(working copy)
@@ -0,0 +1,53 @@ 
+// Copyright (C) 2014 Free Software Foundation, Inc.
+//
+// This file is part of the GNU ISO C++ Library.  This library is free
+// software; you can redistribute it and/or modify it under the
+// terms of the GNU General Public License as published by the
+// Free Software Foundation; either version 3, or (at your option)
+// any later version.
+
+// This library is distributed in the hope that it will be useful,
+// but WITHOUT ANY WARRANTY; without even the implied warranty of
+// MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE.  See the
+// GNU General Public License for more details.
+
+// You should have received a copy of the GNU General Public License along
+// with this library; see the file COPYING3.  If not see
+// <http://www.gnu.org/licenses/>.
+
+// { dg-options "-std=gnu++11" }
+
+#include <cmath>
+#include <random>
+#include <vector>
+#include <algorithm>
+
+#include <testsuite_counter_type.h>
+#include <testsuite_hooks.h>
+
+void test01()
+{
+  using __gnu_test::counter_type;
+  const std::size_t nb_values = 1000;
+
+  std::random_device dev;
+  std::uniform_int_distribution<int> dist;
+  std::vector<counter_type> values;
+  values.reserve(nb_values);
+  for (std::size_t i = 0; i != nb_values; ++i)
+    values.push_back(dist(dev));
+
+  std::make_heap(values.begin(), values.end());
+
+  counter_type::reset();
+
+  std::sort_heap(values.begin(), values.end());
+
+  VERIFY( counter_type::less_compare_count <= 2.0 * nb_values * std::log(nb_values) );
+}
+
+int main()
+{
+  test01();
+  return 0;
+}
Index: testsuite/util/testsuite_counter_type.h
===================================================================
--- testsuite/util/testsuite_counter_type.h	(revision 215958)
+++ testsuite/util/testsuite_counter_type.h	(working copy)
@@ -95,7 +95,10 @@ 
     { return val == rhs.val; }
 
     bool operator<(const counter_type& rhs) const
-    { return val < rhs.val; }
+    {
+      ++less_compare_count;
+      return val < rhs.val;
+    }
   };
 
   int counter_type::default_count = 0;