A short overview of what kernels to use in the current github code. It gets kind of confusing now.
Fermi scrypt: F Kernel
Fermi scrypt-jane: X Kernel
Kepler compute 3.0 scrypt: Y Kernel <--- better than 2013-12-18 release IMHO
Kepler compute 3.0 scrypt-jane: K Kernel
Kepler Compute 3.5 scrypt: Z Kernel <--- most likely better than 2013-12-18 release
Kepler Compute 3.5 scrypt-jane: T Kernel
NOTE: for best speed on Linux with Fermi you want to comment out the two rule lines for fermi_kernel.cu (possibly also for test_kernel.cu) in Makefile.am such that the default rule for compute_10 takes over. Then run autogen.sh and configure again. It seems bizarre, but compiling the code for sm_20 (the actual target hardware) only makes it slower. This puzzling effect was NOT observed on Windows. Comment it out like this:
#fermi_kernel.o: fermi_kernel.cu
# $(NVCC) @CFLAGS@ -Xptxas "-abi=no -v" -arch=sm_20 --maxrregcount=63 $(JANSSON_INCLUDES) -o $@ -c $<
#test_kernel.o: test_kernel.cu
# $(NVCC) @CFLAGS@ -Xptxas "-abi=no -v" -arch=sm_20 --maxrregcount=63 $(JANSSON_INCLUDES) -o $@ -c $<
Christian