Making Compiles Slow Through Abuse of Templates

When doing parallell builds it seems as if PDB-generation takes a lot of time and effort, especially on non-SSD-drives. I’m not sure if this is actually a bottleneck but I hope that this slow program of yours generates a lot of PDB-data so I can get a number on this gut feeling of mine from the next article! Otherwise it might be a good thing to tweak it in order to make it realistic!

Sorry to disappoint but my next article will strictly be covering compilation speed. However I’ll probably cover some aspects of linking and PDBs in a subsequent article.

Until then remember that incremental linking is faster than regular linking which is faster than LTCG linking. Choose wisely.

No worry, now I’ve got two post to look forward to 🙂 Thanks for sharing!

Did you try with a constexpr function ? I am curious to know if a compiler is able to do the same caching, as it would be the modern way to write that kind of compile time computation with a C++11 compliant compiler. It had at least the good bonus to not blow in type instantiation.

That sounds like a great test to run — you should totally do that, and let me know what you find. My guess is that it would natively have the exponential slowdown.

The result of the constexpr try on clang is exponential, i had first to push the limits that prevent long constexpr evaluation “-fconstexpr-depth=2147483647 -fconstexpr-steps=2147483647” 🙂

Pairs are value and time on my computer :
{32, 4s}, {33,6.4s}, {34,10.2s}, {35,16.4s}, {36,26.4s}, {37,42.7s}.

It shows a steady 1.6 scale to reach the next step. It then scales linearly on the number of invocation, so there is no caching. The memory is stable at 14MB.

By looking at the code, it looks like it evaluates the AST instead of compiling it, not sure if it would be possible while keeping track of the restrictions and behaviors of such functions anyway.

Thanks for the suggestion. The crazy number of types created by FibSlow_t had unintended side-effects but the constexpr technique (with the VS 2013 CTP) works like a charm. On my computer I get about 3.8s for 32, and memory consumption goes nuts with anything much larger. But no settings changes were needed.

BTW, code is:

constexpr int const_fib(int n)
{
return n <= 2 ? 1 : const_fib(n – 1) + const_fib(n – 2);
}
constexpr int x = const_fib(30);

Get the VS 2013 CTP to compile it.

And somebody suggested using lambdas for slow compilation:

	// C++11 lambdas aren't terribly useful at producing art.
	// Source below is valid C++11. VS2013 takes about a minute at it (Release config),
	// reaches almost 4GB of memory usage and then gives a
	//
	// 1>ConsoleApplication1.cpp(34): fatal error C1001: An internal error has occurred in the compiler.
	// 1> (compiler file 'msc1.cpp', line 1325)
	// 1> To work around this problem, try simplifying or changing the program near the locations listed above.
	// 1> Please choose the Technical Support command on the Visual C++
	// 1> Help menu, or open the Technical Support help file for more information
	// 1> This error occurred in injected text:
	// 1> :
	// ========== Build: 0 succeeded, 1 failed, 0 up-to-date, 0 skipped ==========
	//
	// With more nested lambdas, it fails with a proper "compiler is out of heap space" error.

	int main()
	{
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	[](){
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	}();
	return 0;
	}

view raw

lambda.cpp

hosted with ❤ by GitHub

The caching technique is good old memoization : http://en.wikipedia.org/wiki/Memoization
Useful in many contexts, though obviously not for computing Fibs 🙂

Great read, and im looking forward to the next article Bruce. Especially on the subject of link times, if you have a section on that (:

Very nice article! Waiting for the next one.

Cheers.

Usually what I do is just include boost::signals2 and I’m done :p
My hypothesis is that the slow compilation comes from either
a) preprocessor (front end ?)
b) code generation (back end?)

As for a) in these times of ssd’s and precompiled header I don’t think (but I need to test) that this is an issue anymore. It;s interesting to use the Preprocess to file option in VS to see how large is the preprocessed file … but this is slower then usual.

For b VS has an undocumented /Bt option (have I read this on your blog?) that you can pass to the compiler and it shows how much it took to compile but nothing more …

I usually see a correlation between the compile time and the resulting obj file size …
A nice tool to inspect obj files is this http://timtabor.com/dumpbinGUI/index.htm
Also some interesting stuff here http://gameangst.com/?tag=code-bloat

About linking (because for sure this will be very requested) I remeber this very old article
http://nedbatchelder.com/blog/200401/speeding_c_links.html

As everybody …really Really REALLY … looking forward to read the article …

The constexpr Fibonacci is better than boost::signals2 because it produces very small object files and PDBs and it allows fine-grained control over how long you want each source file to take — perfect my needs.

I forgot about /Bt I should have mentioned that.

I think large amounts of memory are more important than SSDs for avoiding disk I/O costs — more on that later.

Thanks for the other links.

What a terribly inefficient method of calculating a Fibonacci series! I am sorry, but you have lost all credibility.

I hope you are joking, the horrible inefficiency is the whole point of the article! Part of knowing how to make compile times faster is knowing what makes them slower. If you are serious you need to re-read because you’ve missed the point!

If Fardon is not joking, I would bet he just read some select words from the entire article. Which, BTW, makes us lose credibility of his literacy.

Another solution would be to change the function being computed, to something inherently more costly, even with memoization, such as the Ackermann funciton.

For example: http://coliru.stacked-crooked.com/a/d8624892fb7ebf97

Fibonacci, calculated slowly

Better living through templates

Recursion in linear time

Compile-time caching

Defeating optimizations for fun and profit

How slow can we go?

About brucedawson

18 Responses to Making Compiles Slow Through Abuse of Templates

Leave a comment Cancel reply

Recent Posts

Categories

Meta