ChatGPT解决这个技术问题 Extra ChatGPT

Has anyone actually implemented a Fibonacci-Heap efficiently?

Has anyone of you ever implemented a Fibonacci-Heap? I did so a few years back, but it was several orders of magnitude slower than using array-based BinHeaps.

Back then, I thought of it as a valuable lesson in how research is not always as good as it claims to be. However, a lot of research papers claim the running times of their algorithms based on using a Fibonacci-Heap.

Did you ever manage to produce an efficient implementation? Or did you work with data-sets so large that the Fibonacci-Heap was more efficient? If so, some details would be appreciated.

Haven't you learned these algorithm dudes always hide their huge constants behind their large big oh?! :) It seems in practice, most of the time, that "n" thing never gets even close to the "n0"!
I know - now. I implemented it when I first got my copy of "Introduction to Algorithms". Also, I didn't pick Tarjan for someone who would invent a useless data-structure, because his Splay-Trees are actually quite cool.
mdm: Of course it's not useless, but just like insertion sort which beats quicksort in small data sets, binary heaps might work better due to smaller constants.
Actually, the program I needed the heap for was finding Steiner-Trees for routing in VLSI-chips, so the data sets were not exactly small. But nowadays (except for simple stuff like sorting) I would always use the simpler algorithm until it "breaks" on the data-set.
My answer to this is actually "yes". (Well, my coauthor on a paper did.) I don't have the code right now, so I'll get more info before I actually respond. Looking at our graphs, however, I note that F heaps make less comparisons than b heaps. Were you using something where comparison was cheap?

G
Guilherme Torres Castro

The Boost C++ libraries include an implementation of Fibonacci heaps in boost/pending/fibonacci_heap.hpp. This file has apparently been in pending/ for years and by my projections will never be accepted. Also, there have been bugs in that implementation, which were fixed by my acquaintance and all-around cool guy Aaron Windsor. Unfortunately, most of the versions of that file that I could find online (and the one in Ubuntu's libboost-dev package) still had the bugs; I had to pull a clean version from the Subversion repository.

Since version 1.49 Boost C++ libraries added a lot of new heaps structs including fibonacci heap.

I was able to compile dijkstra_heap_performance.cpp against a modified version of dijkstra_shortest_paths.hpp to compare Fibonacci heaps and binary heaps. (In the line typedef relaxed_heap<Vertex, IndirectCmp, IndexMap> MutableQueue, change relaxed to fibonacci.) I first forgot to compile with optimizations, in which case Fibonacci and binary heaps perform about the same, with Fibonacci heaps usually outperforming by an insignificant amount. After I compiled with very strong optimizations, binary heaps got an enormous boost. In my tests, Fibonacci heaps only outperformed binary heaps when the graph was incredibly large and dense, e.g.:

Generating graph...10000 vertices, 20000000 edges.
Running Dijkstra's with binary heap...1.46 seconds.
Running Dijkstra's with Fibonacci heap...1.31 seconds.
Speedup = 1.1145.

As far as I understand, this touches at the fundamental differences between Fibonacci heaps and binary heaps. The only real theoretical difference between the two data structures is that Fibonacci heaps support decrease-key in (amortized) constant time. On the other hand, binary heaps get a great deal of performance from their implementation as an array; using an explicit pointer structure means Fibonacci heaps suffer a huge performance hit.

Therefore, to benefit from Fibonacci heaps in practice, you have to use them in an application where decrease_keys are incredibly frequent. In terms of Dijkstra, this means that the underlying graph is dense. Some applications could be intrinsically decrease_key-intense; I wanted to try the Nagomochi-Ibaraki minimum-cut algorithm because apparently it generates lots of decrease_keys, but it was too much effort to get a timing comparison working.

Warning: I may have done something wrong. You may wish to try reproducing these results yourself.

Theoretical note: The improved performance of Fibonacci heaps on decrease_key is important for theoretical applications, such as Dijkstra's runtime. Fibonacci heaps also outperform binary heaps on insertion and merging (both amortized constant-time for Fibonacci heaps). Insertion is essentially irrelevant, because it doesn't affect Dijkstra's runtime, and it's fairly easy to modify binary heaps to also have insert in amortized constant time. Merge in constant time is fantastic, but not relevant to this application.

Personal note: A friend of mine and I once wrote a paper explaining a new priority queue, which attempted to replicate the (theoretical) running time of Fibonacci heaps without their complexity. The paper was never published, but my coauthor did implement binary heaps, Fibonacci heaps, and our own priority queue to compare the data structures. The graphs of the experimental results indicate that Fibonacci heaps slightly out-performed binary heaps in terms of total comparisons, suggesting that Fibonacci heaps would perform better in a situation where comparison cost exceeds overhead. Unfortunately, I do not have the code available, and presumably in your situation comparison is cheap, so these comments are relevant but not directly applicable.

Incidentally, I highly recommend trying to match the runtime of Fibonacci heaps with your own data structure. I found that I simply reinvented Fibonacci heaps myself. Before I thought that all of the complexities of Fibonacci heaps were some random ideas, but afterward I realized that they were all natural and fairly forced.


Thanks! This question had been sitting on my mind for a long time. I actually implemented Dijkstra's using Fibonacci-Heaps before I attempted Steiner-Trees. However, it seems that my graphs where much less dense than in your example. They had millions of nodes, but an average degree of only 5-6.
Fib Heap performance is predictable via the sequence of operations. I've written an Heap-heavy algorithm that ended up faster with the Fib Heap (vs. Bin Heap). The trick was batching the work. Regardless of the frequency of any operation, the difference lies here: DecreaseKey - ExtractMin - DecreaseKey - ExtractMin versus DecreaseKey - DecreaseKey - ExtractMin - ExtractMin (contd. below)
The latter will be roughly twice as fast, because the 2nd ExtractMin is nearly free. Our algorithm extracts a batch of Min elements of which many are discarded; a waste on a Bin Heap, but better on a Fib Heap. Sadly this isn't clearly reflected in the time complexity people provide when talking about Heap-based algorithms. With Amortized bounds, the total complexity isn't simply # operations * complexity of operation.
Any chance of also trying pairing heaps and/or relaxed heaps?
I'm not sure why your results appeared so close, I used STL priority_queue vs self-implemented fibonacci heap and the binary heap was behind by a large margin in my tests.
p
paperhorse

Knuth did a comparison between fibonacci heap and binary heaps for minimum spanning trees back in 1993 for his book Stanford Graphbase. He found fibonacci to be 30 to 60 precent slower than binary heaps at the graph sizes he was testing, 128 vertices at different densities.

The source code is in C (or rather CWEB which is a cross between C, math and TeX) in the section MILES_SPAN.


C
Community

Disclaimer

I know results are quite similar and "looks like running time is totally dominated by something other than the heap" (@Alpedar). But I couldn't find anything evidence of that in the code. The code it's open, so if you can find anything that can affect the result of the test, please tell me.

Maybe I did something wrong, but i wrote a test, based on A.Rex anwser comparing:

Fibonacci-Heap

D-Ary-heap (4)

Binary-Heap

Relaxed-Heap

The execution times (for complete graphs only) for all heaps were very close. The test were made for complete graphs with 1000,2000,3000,4000,5000,6000,7000 and 8000 vertices. For each test 50 random graphs were generated and the output is the average time of each heap:

Sorry about the output, it's not very verbose because i needed it to build some charts from text files.

Here are the results (in seconds):

https://i.stack.imgur.com/Gsd05.png


How much edges are there in each case? And what algorithm are you running exactly? Your results don't make sense if we don't know what we are dealing with.
As i sad , all graph are completes , so you can calculate the number of edges for each case. What you mean, "are you running exactly". They are in the head of the tables.
This looks like running time is totally dominated by something other than the heap (it could be generating of graph or some IO). Those almost exactly same results are unbelievable.
Well maybe the time os dominated by something else, but i am sure that isn't the IO or the generation of the graphs. Anyway the source code is available and i will be very glad if someone find an error and correct the mesure.
Those tests seem to be measuring something entirely different. Could you comment on the test you ran? Remember that the Shortest Path Problem on a complete graph is O(1) if distances are Geometric/Euclidian.
g
gabormakrai

I also did a small experiment with Fibonacci heap. Here is the link for the details: Experimenting-with-dijkstras-algorithm. I just googled the terms "Fibonacci heap java" and tried a few existing open-source implementation of the Fibonacci heap. It seems that some of them have some performance issue, but there are some which is quite good. At least, they are beating the naive and the binary heap PQ performance in my test. Maybe they can help to implement the efficient one.