2003-07-05 Steven G. Johnson * kernel/ifftw.h: typo * kernel/ifftw.h: removed typo 2003-07-05 Matteo Frigo * dft/ct-generic.c: Consistent naming * dft/Makefile.am, dft/conf.c, dft/ct-directw.c, dft/ct-directwbuf.c, dft/ct-generic.c, dft/ct.c, dft/ct.h, dft/ctsq.c, dft/dft.h, dft/dftw-dft.c, dft/direct.c, dft/directw.c, dft/directwbuf.c, dft/generic.c, dft/kdft-dif.c, dft/kdft-difsq.c, dft/kdft-dit.c, dft/plan.c, dft/problemw.c, dft/rader.c, dft/solve.c: Got rid of problemw. 2003-07-04 Matteo Frigo * kernel/cycle.h, kernel/timer.c: Increase TIME_MIN on intel only * genfft/schedule.ml: A little hack to get more consistent scheduling. 2003-07-04 Steven G. Johnson * acinclude.m4, genfft/oracle.ml, tools/fftw-wisdom-to-conf.in: merged changes from mainline * configure.ac: bumped version * NEWS: updated for 3.0.1 2003-07-03 Matteo Frigo * genfft/magic.ml, genfft/schedule.ml: New experimental scheduler (currently disabled). The old scheduler is ``optimal'' in the sense that it minimizes register pressure. The only way to reduce register pressure is to schedule dependent instructions as closely as possible, so as to minimize the life time of registers. This strategy maximizes the number of pipeline stalls, however. With enough registers and short enough pipelines, this tradeoff is fine. This is no longer the case for the devilish pipeline of the Pentium IV or (probably) the PowerPC 970. The new scheduler switches to a ``list scheduler'' for dags smaller than a specified size. The list scheduler executes a butterfly left to right one column at the time. This amounts to the best possible pipeline utilization, and the worst possible register pressure. The ``specified size'' defaults to 0, i.e., no change from fftw2 and fftw-3.0. It seems like a value of 7--10 produces the best results for Pentium IV (probably screwing the G3/G4 powerpcs and sparc, but I haven't tried.) As time goes by, we may want to increase this number to favor newer processors over older processors. 2003-06-25 Steven G. Johnson * tools/fftw-wisdom-to-conf.in: remove non-portable use of tempfile; thanks to Nicolas Decoster for the patch * acinclude.m4: increase stupid HP preprocessor limits 2003-06-19 Matteo Frigo * genfft/Makefile.am: Distribute gen_mdct.ml 2003-06-16 Steven G. Johnson * dft/simd/codelets/Makefile.am: remove -trivial-stores, as in mainline 2003-06-11 Matteo Frigo * rdft/buffered2.c, rdft/rdft2-radix2.c: Cleared int/ptrdiff_t confusions * kernel/planner.c, dft/dftw-dft.c, dft/directwbuf.c, dft/rank0.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c: Cleared int/ptrdiff_t confusion 2003-06-08 Matteo Frigo * kernel/timer.c: Increased TIME_MIN. This seems to produce more reliable plans on Pentium IV. * dft/simd/codelets/Makefile.am: Removed relic -trivial-stores, which dates back to Franz's early experiments. Speed improved on SSE2, both with gcc and icc. 2003-06-06 Steven G. Johnson * doc/fftw3.texi: fix direntry 2003-06-05 Steven G. Johnson * genfft/gen_mdct.ml: added imdct 2003-06-04 Matteo Frigo * genfft/c.ml: Collect pattern (a * b) +- (c * d) in generic-arith, because this operation can usually be computed with one rounding in fixed-point (and it possibly exposes a FMA instruction) * genfft/c.ml, genfft/magic.ml: Generic-arithmetic unparser 2003-06-01 Matteo Frigo * genfft-k7/oracle.ml, genfft/oracle.ml: Oops---randomized CSE was using the same random numbers over and over * genfft/c.ml: Paranoia. * genfft/oracle.ml: Use relative error instead of absolute error, to avoid problems when normalization factors are used. 2003-06-01 Steven G. Johnson * reodft/reodft11e-radix2.c: slight opt * reodft/reodft11e-radix2.c: slight optimization * genfft/gen_mdct.ml: *W is const * genfft/gen_mdct.ml: comment 2003-05-30 Steven G. Johnson * genfft/Makefile.am, genfft/gen_mdct.ml: added experimental MDCT 2003-05-29 Steven G. Johnson * mkdist.sh: merge from main branch * mkdist.sh: altivec (fma) needs simd codlist.c too * mkdist.sh: make sure we include SIMD codlist.c for non-Unix folks * mkdist.sh: make sure we include simd codlist.c 2003-05-28 Steven G. Johnson * doc/fftw3.texi: noted howmany_rank == 0 is a single transform * doc/fftw3.texi: further stride clarification 2003-05-26 Matteo Frigo * dft/Makefile.am, dft/conf.c, dft/ct.c, dft/ctsq.c, dft/dft.h, dft/dftw-dft.c, dft/directw.c, dft/directwbuf.c, dft/directwsq.c, dft/kdft-difsq.c, dft/problemw.c: Removed transposed dftw problems. I now consider transposed dftw a Bad Idea, since it does not apply to the case that it was originally meant for (speed up four-step) and it complicates the implementation of the other thing I want to try (dftw m-slices). * dft/buffered.c: Obsolete comment 2003-05-24 Matteo Frigo * dft/ct.c: comment * dft/dftw-dft.c: Oops---wrong test NO_UGLYP instead of !NO_UGLYP * dft/ct.c: Implemented radix r, where n=r^2 * p 2003-05-21 Steven G. Johnson * doc/FAQ/fftw-faq.bfnn: xlc seems to properly use fma as well * configure.ac, doc/fftw3.texi: print warning if there is no cycle counter 2003-05-20 Steven G. Johnson * libbench2/verify-lib.c, libbench2/verify-r2r.c: updated Funda reference 2003-05-20 Matteo Frigo * dft/ct.c: const 2003-05-19 Matteo Frigo * dft/dftw-dft.c, dft/directwsq.c: Implemented generic dif square transposed (q-style) solver. * dft/dftw-dft.c: applicable() is now a property of the solver (in anticipation of transposed solvers) * dft/dftw-dft.c: Slight cleanup 2003-05-18 Matteo Frigo * dft/bluestein.c, kernel/ifftw.h, kernel/primes.c: Nothing, really * dft/dftw-dft.c: Moved vector loop inside bytwiddle(), in anticipation of a q-style dftw-dit transposed solver. * dft/dftw-dft.c: Fixed flops count * dft/dftw-dft.c: style * dft/dftw-dft.c: Faster inner loop. 2003-05-17 Matteo Frigo * dft/dftw-dft.c: Print vector length * dft/dftw-dft.c: Oops * dft/dftw-dft.c: Allow vl > 1 * dft/ctsq.c: Radix can be derived from problem---no need to pre-specify it. 2003-05-17 Steven G. Johnson * kernel/transpose.c: fixed comment * kernel/transpose.c: whoops, gcd is static * kernel/transpose.c: whoops, gcd should be static * kernel/transpose.c: more unrolling 2003-05-17 Matteo Frigo * dft/bluestein.c: Hack to avoid infinite recursion. 2003-05-16 Steven G. Johnson * dft/codelet-dft.h: consistency 2003-05-16 Matteo Frigo * dft/bluestein.c: Wrong comment. * dft/bluestein.c: Style. 2003-05-16 Steven G. Johnson * doc/FAQ/fftw-faq.bfnn: punctuation * doc/FAQ/fftw-faq.bfnn: added allzero FAQ * dft/bluestein.c: simplification: instead of cldb, just use cldf with inputs/output values swapped 2003-05-16 Matteo Frigo * dft/bluestein.c: Allow more general transform sizes. 2003-05-16 Steven G. Johnson * kernel/ifftw.h: MS has __int64, not long long (grr) * kernel/ifftw.h: slight change * kernel/ifftw.h: MS has __int64 type, not long long (grr) 2003-05-16 Matteo Frigo * dft/ct.c: Fixed printout * dft/bluestein.c: Fixed flop count * dft/Makefile.am, dft/bluestein.c, dft/conf.c, dft/dft.h: New bluestein solver * dft/ct.c: Implemented generic radix. * dft/generic.c, kernel/ifftw.h, kernel/twiddle.c: Removed conditional branch from inner loop in generic.c * dft/generic.c: Simplified indexing * dft/generic.c: Better still. * dft/generic.c: Further improvement of generic solver * dft/rader.c, dft/dftw-dft.c: Cleanup * dft/generic.c: Generic now only works for odd sized. Added check. * kernel/ifftw.h: Increased GENERIC_MIN_BAD because of new algorithm. * dft/generic.c: Much, much better. * dft/generic.c: Still trying to understand why rdft-generic-dit is faster then dft-generic... * dft/generic.c: Nothing, really * dft/generic.c: Never be clever for the sake of being clever. * dft/generic.c: Simplified. generic-dit is gone. The solver is now out-of-place only---buffering is done by the buffered solver. 2003-05-15 Matteo Frigo * dft/rader.c: rader-dit is gone. * dft/plan.c: Cast * configure.ac, dft/Makefile.am, dft/buffered.c, dft/codelet-dft.h, dft/conf.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/ct.h, dft/ctsq.c, dft/dft.h, dft/dftw-dft.c, dft/directw.c, dft/directwbuf.c, dft/directwsq.c, dft/kdft-dif.c, dft/kdft-difsq.c, dft/kdft-dit.c, dft/plan.c, dft/problemw.c, dft/rader.c, dft/solve.c: Introduced twiddle problem ``dftw''. Changed most other things to deal with this change. 2003-05-15 Steven G. Johnson * kernel/primes.c: whoops, X(safe_mulmod) not fftw_safe_mulmod * simd/sse.c, simd/sse2.c: add VC++ versions of asm * simd/simd-sse2.h, simd/simd-sse.h: VC++ reportedly supports the intel intrinsics, but requires __inline instead of __inline__ * kernel/ifftw.h: precompute array indices with VC++ * acx_pthread.m4: added doc note 2003-05-14 Steven G. Johnson * threads/threads.c: autodetect windows * libbench2/getopt.c: don't bother with #ifdef HAVE_CONFIG_H, since non-Unix users always forget to define it 2003-05-13 Steven G. Johnson * kernel/cycle.h: VC++ uses __inline * doc/FAQ/fftw-faq.bfnn: added leak question 2003-05-12 Steven G. Johnson * kernel/cycle.h: LARGE_INTEGER needs windows.h (supposedly, there is some problem converting _itnt64 to double...damn MS and their nonstandard types) * libbench2/timer.c: whoops * tools/fftw-wisdom.c: added 256x256 to canonical list 2003-05-12 Matteo Frigo * kernel/transpose.c: Oops... 2003-05-11 Matteo Frigo * kernel/transpose.c: Unrolled loops, changed cutoff * tests/bench.c: Do not multiply strides by 2 twice. 2003-05-08 Steven G. Johnson * tests/Makefile.am: added 'make smallcheck' * configure.ac, doc/fftw3.texi, kernel/timer.c: --without-cycle-counter becomes --with-slow-timer, updated docs 2003-05-07 Steven G. Johnson * configure.ac: remove duplicate -openmp check; Sun requires -xopenmp * dft/ct-ditbuf.c, rdft/hc2hc-buf.c: fixed compilation under Sun C++ 2003-05-07 Matteo Frigo * kernel/planner.c, kernel/timer.c: Use estimator if cycle counter is unavailable, regardless of the FFTW_MEASURE/ESTIMATE setting. 2003-05-07 Steven G. Johnson * kernel/cycle.h: _WIN32 (not __WIN32__) is always defined * kernel/cycle.h: minor cleanup * kernel/cycle.h: tentative VC++ stuff, some consolidation 2003-05-06 Steven G. Johnson * kernel/cycle.h, kernel/timer.c: made cycle.h more self-contained 2003-05-06 Matteo Frigo * simd/simd-3dnow.h, simd/simd-sse.h, simd/simd-sse2.h: Use ``%'' flag to denote commutative operations. 2003-05-06 Steven G. Johnson * kernel/cycle.h: MIT license, brief documentation * doc/Makefile.am: whoops, forgot f77_wisdom.f 2003-05-04 Matteo Frigo * dft/problem.c, libbench/mp.c, libbench2/bench.h, libbench2/mp.c, libbench2/verify-lib.c, rdft/problem2.c: Improved speed of accuracy test. 2003-04-29 Matteo Frigo * kernel/cycle.h: s390 cycle counter 2003-04-26 Steven G. Johnson * doc/fftw3.texi: forgot r2r directory * rdft/Makefile.am, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c: delete unused files, since they don't compile any more 2003-04-24 Matteo Frigo * simd/simd-sse2.h: Better gcc code generation 2003-04-23 Steven G. Johnson * acinclude.m4: ccc is the Compaq C compiler on Linux/alpha * doc/fftw3.texi: whoops 2003-04-19 Matteo Frigo * kernel/cycle.h: ia64 cycle counter with intel compiler. 2003-04-18 Matteo Frigo * doc/FAQ/fftw-faq.bfnn: More gcc bugs. Sigh. * bootstrap.sh: touch ChangeLog to observe GNU standards * ChangeLog: We now build ChangeLog automatically at distribution time * mkdist.sh: Automatic ChangeLog hackery 2003-04-18 Steven G. Johnson * doc/FAQ/fftw-faq.bfnn: plural * NEWS: updated 2003-04-18 Matteo Frigo * ChangeLog: Updated 2003-04-18 Steven G. Johnson * doc/FAQ/fftw-faq.bfnn: a -> an * doc/FAQ/fftw-faq.bfnn: hyphen * doc/FAQ/fftw-faq.bfnn: comma * doc/FAQ/fftw-faq.bfnn: minor 2003-04-18 Matteo Frigo * doc/FAQ/fftw-faq.bfnn: Updated * mkdist.sh: New script that builds the distributions * dft/simd/codelets/Makefile.am: Oops again * dft/simd/codelets/Makefile.am: Oops, forgot -sign 1 * configure.ac, dft/simd/codelets/Makefile.am, dft/simd/n1b.c, dft/simd/n1b.h, dft/simd/n1f.c, dft/simd/n1f.h, dft/simd/n2b.c, dft/simd/n2b.h, dft/simd/n2f.c, dft/simd/n2f.h: Reorganization of simd codelets * genfft-k7/gen_notw.ml, genfft-k7/gen_twiddle.ml: k7 assembly was not updated after conversion of opcnt from int to double 2003-04-17 Matteo Frigo * dft/vrank2-transpose.c, dft/vrank3-transpose.c: Capital `X' looks bad in all-lowercase plans * dft/codelets/standard/Makefile.am, dft/simd/codelets/Makefile.am, rdft/codelets/hc2r/Makefile.am, rdft/codelets/r2hc/Makefile.am: Removed redundant inline/noinline codelets * genfft/Makefile.am, genfft/gen_hc2hc.ml, genfft/gen_hc2r.ml, genfft/gen_hc2r_noinline.ml, genfft/gen_r2hc.ml, genfft/gen_r2hc_noinline.ml, genfft/gen_r2r.ml, kernel/ifftw.h, rdft/codelets/hc2r/Makefile.am, rdft/codelets/r2hc/Makefile.am, support/Makefile.codelets: New noinline Noinline real codelets 2003-04-17 Steven G. Johnson * TODO: more ideas 2003-04-17 Matteo Frigo * dft/simd/codelets/Makefile.am: Removed duplicate rules. * Makefile.am: acx_pthread.m4 was not distributed * support/Makefile.codelets: Oops * dft/codelets/standard/Makefile.am, genfft/Makefile.am, genfft/gen_notw.ml, genfft/gen_notw_noinline.ml, support/Makefile.codelets: Both inlined and non-inlined notw codelets. * dft/simd/codelets/Makefile.am, genfft/Makefile.am, genfft/gen_notw_noinline_c.ml, support/Makefile.codelets: Initial experiment with both inlined and non-inlined simd codelets. Both are included for now. * configure.ac, support/Makefile.codelets: --enable-fma to build FMA distribution 2003-04-16 Matteo Frigo * genfft/gen_notw_c.ml: Inline SIMD nontwiddle codelets * simd/simd-sse.h, simd/simd-sse2.h: Pathetic attempt at saving a couple of registers... * genfft/gen_hc2r.ml, genfft/gen_notw.ml, genfft/gen_notw_c.ml, genfft/gen_r2hc.ml, genfft/gen_r2r.ml, genfft/gen_twiddle.ml, genfft/gen_twiddle_c.ml: for (i = 0; i < m; ++i) ==> for (i = m; i > 0; --i) No proof of evidence that this is any faster, but just in case... 2003-04-15 Steven G. Johnson * dft/vrank-geq1.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: added hack to make sure that codelet loops are preferred to vecloop solvers in the estimator * api/f77funcs.h, api/fftw3.h, api/flops.c, doc/fftw3.texi, kernel/ifftw.h, tests/bench.c: use double for flops * kernel/cycle.h: metrowerks reportedly supports gcc assembly extensions on ppc 2003-04-14 Matteo Frigo * simd/Makefile.am, dft/simd/Makefile.am: foo_CFLAGS generates some automake junk that breaks the build on Redhat 7.3. Screw it. * tests/check.pl: Carefully check return status 2003-04-13 Matteo Frigo * genfft/c.ml, genfft/simd.ml, kernel/ifftw.h, support/Makefile.codelets: Removed annoying -FMA() expressions. 2003-04-12 Matteo Frigo * kernel/ifftw.h: Major fma hackery * api/apiplan.c: Slight cleanup * configure.ac: Updated version number * acinclude.m4: Damn autoconf * acinclude.m4: Recognize all 74xx processors * acinclude.m4: Detect 7400 processor. * acinclude.m4: No need to check for gcc-2.95 2003-04-11 Steven G. Johnson * NEWS: removed duplicate 2003-04-11 Matteo Frigo * libbench2/report.c: mflops ==> ``mflops'' * libbench2/report.c: Print setup time as well 2003-04-10 Matteo Frigo * simd/taint.c, rdft/problem.c, rdft/problem2.c, kernel/ifftw.h, dft/problem.c: Enforce pointer equality for in-place problems. 2003-04-09 Steven G. Johnson * ChangeLog, NEWS: updated * tests/README: cross-ref fftw-wisdom man page 2003-04-09 Matteo Frigo * kernel/planner.c: Undone previous change, committed by mistake. * tests/Makefile.am, tests/README, kernel/planner.c: Quick and dirty README for bench * libbench2/bench-main.c, libbench2/timer.c: Consider additional command-line arguments as problems to be benchmarked. * libbench2/bench-main.c, libbench2/bench.h, libbench2/report.c: Default report format is now human-readable. Removed unnecessary complexity in benchmark reporting. * doc/fftw3.texi: Updated for new interleaved/split api. 2003-04-09 Steven G. Johnson * doc/fftw3.texi: updated citation 2003-04-08 Matteo Frigo * configure.ac: Time for beta3 2003-04-08 Steven G. Johnson * reodft/redft00e-r2hc-pad.c: whoops, added * doc/fftw3.texi: more comparison of different R*DFT types * reodft/redft00e-r2hc.c, reodft/rodft00e-r2hc.c: comments * reodft/Makefile.am, reodft/conf.c, reodft/reodft.h, reodft/rodft00e-r2hc-pad.c: more accurate DCT-I and DST-I, at the expense of up to a factor of 2 in speed and memory 2003-04-08 Matteo Frigo * kernel/planner.c: Workaround gcc/sparc bug 2003-04-08 Steven G. Johnson * doc/fftw3.texi: rumors 2003-04-07 Steven G. Johnson * tests/hook.c: added rdft2 paranoid mode * tests/hook.c: added paranoid mode for r2r * libbench2/verify-r2r.c: whoops, sincos is predefined on some systems 2003-04-05 Matteo Frigo * tests/hook.c: bp->destroy_input was not initialized * dft/problem.c, kernel/ifftw.h, rdft/problem.c, rdft/problem2.c: Asserted correctness conditions for tainted pointers. (For now, use CK() while we test. They should be changed into A() at some point.) * dft/problem.c, rdft/problem.c, rdft/problem2.c: Untaint pointers before zero'ing arrays and before hashing * libbench2/bench-main.c: Alignment check did not work with icc, which seems to be confused by the fact that the variable is not used. * tests/Makefile.am: More paranoid paranoid-check * kernel/ifftw.h: 0 == x & 7 parses as (0 == x) & 7, which is wrong 2003-04-05 Steven G. Johnson * dft/direct.c, kernel/ifftw.h, kernel/planner.c, libbench2/bench-main.c, rdft/direct.c, rdft/direct2.c: alignment checks * rdft/rdft-dht.c: prevent infinite loops in exhaustive planning * api/Makefile.am, api/api.h, api/apiplan.c, api/execute-dft-c2r.c, api/execute-dft-r2c.c, api/execute-dft.c, api/execute-split-dft-c2r.c, api/execute-split-dft-r2c.c, api/execute-split-dft.c, api/f77funcs.h, api/fftw3.h, api/mktensor-iodims.c, api/plan-guru-dft-c2r.c, api/plan-guru-dft-r2c.c, api/plan-guru-dft.c, api/plan-guru-r2r.c, api/plan-guru-split-dft-c2r.c, api/plan-guru-split-dft-r2c.c, api/plan-guru-split-dft.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/plan-many-dft.c, api/plan-many-r2r.c, tests/bench.c: split/unsplit guru interface 2003-04-05 Matteo Frigo * tests/hook.c: Need UNTAINT in verifier too. * simd/taint.c: Forgot #if HAVE_SIMD * api/fftw3.h, kernel/align.c, kernel/ifftw.h, simd/Makefile.am, simd/simd.h, simd/taint.c: Keep track of two separate taint bits 2003-04-05 Steven G. Johnson * api/api.h, api/fftw3.h, api/mapflags.c, api/plan-guru-dft-c2r.c, api/plan-guru-dft-r2c.c, api/plan-guru-dft.c, api/plan-guru-r2r.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/plan-many-dft.c, api/plan-many-r2r.c, dft/k7/k7.c, dft/simd/n1b.c, dft/simd/n1f.c, dft/simd/n2b.c, dft/simd/n2f.c, dft/simd/q1b.c, dft/simd/q1f.c, dft/simd/t1b.c, dft/simd/t1f.c, kernel/ifftw.h, tests/bench.c: added NO_SIMD problem flag, made UNALIGNED an API issue (taints input pointers) 2003-04-04 Steven G. Johnson * dft/buffered.c, rdft/buffered.c, rdft/buffered2.c: bugfix in buffered: wrong pointers passed for cldrest; also use TAINT instead of UNALIGNED in buffered2 2003-04-04 Matteo Frigo * dft/vrank-geq1.c: Reverted previous change, committed accidentally * kernel/align.c: What was I thinking? * dft/vrank-geq1.c: *** empty log message *** 2003-04-04 Steven G. Johnson * configure.ac, libbench2/aligned-main.c: added --enable-debug-alignment * kernel/align.c, kernel/ifftw.h: X(taint) prototype, define corresponding function only if HAVE_SIMD 2003-04-04 Matteo Frigo * dft/buffered.c, dft/solve.c, dft/vrank-geq1.c, kernel/align.c, kernel/ifftw.h, rdft/buffered.c, rdft/buffered2.c, rdft/solve.c, rdft/solve2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: Initial checkin of tained pointers * rdft/buffered.c, rdft/buffered2.c, rdft/dht-rader.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, kernel/align.c, kernel/ifftw.h, dft/simd/n2b.c, dft/simd/n2f.c, dft/buffered.c, dft/rader.c, dft/vrank-geq1.c: More conservative preservation of alignment 2003-04-04 Steven G. Johnson * api/apiplan.c, api/execute-dft-c2r.c, api/execute-dft-r2c.c, api/execute-dft.c, api/execute-r2r.c, api/execute.c, api/f77funcs.h: plan/execute with aligned stack 2003-04-03 Steven G. Johnson * api/Makefile.am: whoops, missed FFTW_MEASURE in fftw3.f * threads/threads.c: use WITH_ALIGNED_STACK for experimental semaphore stuff, too 2003-04-03 Matteo Frigo * kernel/stack.c: Removed old file * kernel/Makefile.am, kernel/ifftw.h, threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/threads.h, threads/vrank-geq1-rdft2.c: Improved stack-alignment hack 2003-04-03 Steven G. Johnson * threads/threads.c: use aligned stack for experimental semaphores, too * kernel/ifftw.h, kernel/stack.c, threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/threads.c, threads/threads.h, threads/vrank-geq1-rdft2.c: whoops * kernel/ifftw.h, kernel/stack.c, threads/ct-dit.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/threads.c: fix(?) for SIMD thread problems * doc/fftw3.texi: noted n=1 REDFT01 case * doc/fftw3.texi: note about n=2 REDFT00 formula * doc/fftw3.texi: note about undefined REDFT00 * doc/fftw3.texi: noted n=1 RODFT01 case * doc/equation-redft11.png, doc/equation-rodft01.png, doc/equation-rodft11.png, doc/fftw3.texi: corrected definitions * rdft/codelet-rdft.h, rdft/problem.c, rdft/vrank-geq1.c: added REODFT_KINDP, fixed nontrivial test for R2HC11 and HC2R11 (not that we support these yet anyway) * rdft/codelets/hc2r/Makefile.am, rdft/problem.c: size 2 hc2r and dht are equivalent to r2hc 2003-04-02 Steven G. Johnson * doc/fftw3.texi: noted overwriting in upgrading section 2003-04-02 Matteo Frigo * kernel/Makefile.am, kernel/align.c, kernel/stack.c: Moved with_aligned_stack to its own file * kernel/align.c, libbench2/aligned-main.c: Fixed comments * kernel/align.c, kernel/ifftw.h, libbench2/aligned-main.c, libbench2/bench-main.c: Alignment hacks 2003-04-01 Steven G. Johnson * threads/threads.c: phew, no, previous version was okay * threads/threads.c: whoops, crap 2003-04-01 Matteo Frigo * simd/simd-sse2.h: support sse2 in forthcoming gcc-3.3 2003-04-01 Steven G. Johnson * kernel/cycle.h: comment * kernel/cycle.h: noted ac_check_headers * kernel/cycle.h: comment * kernel/cycle.h: documented autoconf tests, so that cycle.h can be distributed separately * NEWS: IRIX is all-caps * NEWS: noted Irix fix * threads/api.c, threads/threads.h: whoops * threads/threads.c: use ithreads_init so as not to confuse fftw 2 users * threads/threads.c: IRIX lossage * configure.ac: check for -openmp (icc) among the OpenMP flags (TODO: make this a separate macro, with a loop instead of repeated checks) 2003-03-31 Steven G. Johnson * doc/fftw3.texi: clarification 2003-03-31 Matteo Frigo * acinclude.m4: More liberal test for solaris CC * simd/simd-sse.h, simd/simd-sse2.h: Allow x86-64 simd * kernel/cycle.h: Added x86-64 timer code 2003-03-31 Steven G. Johnson * NEWS, ChangeLog: updated * doc/FAQ/fftw-faq.bfnn: colon 2003-03-31 Matteo Frigo * doc/FAQ/fftw-faq.bfnn: Reorganized compiler bugs section (which is growing out of control) * doc/FAQ/fftw-faq.bfnn: solaris gcc bug appears to be also in 2.95.2 * kernel/planner.c: Workaround works---there is another gcc/sparc bug elsehwere * kernel/planner.c: Grrr, workaround does not work. * kernel/planner.c: ADDMOD is now function, which seems to avoid gcc bugs. 2003-03-30 Matteo Frigo * kernel/planner.c: Workaround sparc gcc bug 2003-03-30 Steven G. Johnson * doc/fftw3.texi: note * dft/vrank2-transpose.c, dft/vrank3-transpose.c: make non-square UGLY, for now * tests/bench.c: added -o amnesia to forget_wisdom before each plan 2003-03-30 Matteo Frigo * libbench2/bench-user.h, libbench2/report.c, libbench2/speed.c: Report setup time in benchmark 2003-03-30 Steven G. Johnson * kernel/transpose.c: comment * doc/fftw3.texi: slight change 2003-03-29 Matteo Frigo * kernel/ct.c: More relaxed definition of UGLYness 2003-03-29 Steven G. Johnson * rdft/codelet-rdft.h, rdft/hc2hc.h, rdft/rdft.h, reodft/reodft.h, threads/threads.h: no more cvs id strings in header files...I'm tired of having to rebuild everything after a commit * rdft/Makefile.am, rdft/buffered2.c, rdft/direct2.c, rdft/rdft.h, rdft/rdft2-inplace-strides.c, rdft/rdft2-strides.c, rdft/rdft2-tensor-max-index.c, rdft/vrank-geq1-rdft2.c, threads/vrank-geq1-rdft2.c: rdft2 stride unification * rdft/vrank-geq1-rdft2.c: preserve in-place-ness * tests/Makefile.am, tests/bench.c, tests/check.pl: make nowisdom the default 2003-03-29 Matteo Frigo * tests/Makefile.am: --verbose in paranoid-check produces too much output. Make it quiet. 2003-03-29 Steven G. Johnson * dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/ifftw.h, kernel/transpose.c: fixed transpose bugs...need to check ri-ii before deciding whether Ntuple fits 2003-03-29 Matteo Frigo * tests/check.pl: try more 2^k * kernel/ifftw.h: MIN_ALIGNMENT was defined after being used, causing crash in sse2. 2003-03-29 Steven G. Johnson * kernel/Makefile.am, kernel/ifftw.h, kernel/tensor10.c, kernel/transpose.c, rdft/Makefile.am, rdft/conf.c: real transposes are currently unused, and are not needed for MPI code either * dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/Makefile.am, kernel/ifftw.h, kernel/transpose.c: added general transpose * libbench2/problem.c: added transposition option * dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/Makefile.am, kernel/ifftw.h, kernel/tensor10.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c: yikes, fixed incorrect applicability of transpose plans * rdft/dft-r2hc.c: in the future, we might want to allow sz->rnk == 0, vecsz->rnk arbitrary to be converted to r2hc (the apply function already should work for this case)...disabled for now, though * kernel/align.c, kernel/ifftw.h, rdft/vrank-geq1-rdft2.c: use most_unaligned in rdft2 * tests/Makefile.am: slight change * tests/Makefile.am: output message when checks pass 2003-03-28 Steven G. Johnson * kernel/ifftw.h: added ifndef alloca around alloca stuff 2003-03-28 Matteo Frigo * rdft/dht-rader.c, kernel/ifftw.h, kernel/align.c, dft/rader.c, dft/vrank-geq1.c: Proper alignment in rader 2003-03-28 Steven G. Johnson * kernel/ifftw.h: whitespace * kernel/ifftw.h: whoops, alloca stuff inside HAVE_ALLOCA * tests/Makefile.am: make check can afford to be a little bigger * kernel/ifftw.h: use same alloca macrology as configure script * kernel/ifftw.h: fallback is no longer needed for mingw * kernel/ifftw.h: alloca fallback for gcc * configure.ac: _alloca was added for MinGW, but it causes problems there * kernel/align.c: fixed most_unaligned for split format * Makefile.am: whoops * Makefile.am, configure.ac, fftw.pc.in: added pkg-config 2003-03-27 Steven G. Johnson * dft/vrank-geq1.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: fixed asserts 2003-03-27 Matteo Frigo * kernel/align.c: Do not adjust r/i pointers separately. * dft/simd/n2b.h, dft/simd/n2f.h: iForgot to add files * dft/simd/codelets/Makefile.am, dft/simd/Makefile.am, dft/simd/n1b.c, dft/simd/n1b.h, dft/simd/n1f.c, dft/simd/n1f.h, dft/simd/n2b.c, dft/simd/n2f.c, configure.ac: Specialized n simd codelets for unit vector stride. * configure.ac: Changed version number to beta2 * api/mapflags.c, dft/simd/n1b.c, dft/simd/n1b.h, dft/simd/n1f.c, dft/simd/n1f.h, dft/simd/q1b.c, dft/simd/q1f.c, dft/simd/t1b.c, dft/simd/t1f.c, dft/vrank-geq1.c, kernel/align.c, kernel/ifftw.h, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: Changed alignment requirements for n1 simd codelets. Changed mechanism for detecting lack of alignment. * tests/bench.c: Oops, wrong place for hook 2003-03-27 Steven G. Johnson * dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, dft/k7/codelets/Makefile.am, dft/simd/codelets/Makefile.am, rdft/codelets/hc2r/Makefile.am, rdft/codelets/r2hc/Makefile.am, rdft/codelets/r2r/Makefile.am: added comments to codelet makefiles, to aid people wanting to generate their own code * doc/FAQ/fftw-faq.bfnn: Matteo is also a copyright holder * doc/FAQ/fftw-faq.bfnn: FORTRAN is officially Fortran, these days * doc/FAQ/fftw-faq.bfnn: punctuation * doc/FAQ/fftw-faq.bfnn: don't use "wrapper" * doc/FAQ/fftw-faq.bfnn: plural * doc/FAQ/fftw-faq.bfnn: grammar * doc/FAQ/fftw-faq.bfnn: better phrasing * kernel/align.c: stddef.h should not be needed anymore for this file * dft/codelets/standard/Makefile.am: added comments for Franz mode * dft/simd/codelets/Makefile.am: clarification * dft/simd/codelets/Makefile.am: commented on FRANZ codelets * NEWS: updated * dft/codelets/inplace/Makefile.am: disable DIF codelets, since they are never used (apparently) except for some non-power-of-two sizes...improve support for the latter by adding size 3, 5, and 6 q^2 codelets. * doc/fftw3.texi: DHT has no forward/backward 2003-03-27 FFTW Maintainers * tests/bench.c: added hacky way to use an arbitrary flag 2003-03-27 Matteo Frigo * tests/bench.c: Better place to install hook 2003-03-27 Steven G. Johnson * doc/FAQ/fftw-faq.bfnn: noted that the user should run make check if they think FFTW has a bug 2003-03-26 Matteo Frigo * kernel/planner.c: Oops, what am I thinking * kernel/planner.c: Grrr.... fixed bug in estimator * genfft/c.ml: Oops---the flop count was right. The estimator is broken elsewhere. * genfft/c.ml: Fixed SIMD estimator * dft/simd/Makefile.am, dft/simd/codelets/Makefile.am, dft/simd/q1b.c, dft/simd/q1b.h, dft/simd/q1f.c, dft/simd/q1f.h, dft/simd/t1b.c, dft/simd/t1f.c, genfft/Makefile.am, genfft/gen_twiddle_c.ml, genfft/gen_twidsq_c.ml, support/Makefile.codelets: Added twidsq simd codelets 2003-03-26 Steven G. Johnson * doc/fftw3.texi: gensrc -> genfft * TODO: newline 2003-03-26 Matteo Frigo * TODO: Noted need to add dif simd codelets 2003-03-25 Steven G. Johnson * doc/fftw3.texi: noted shift * doc/fftw3.texi: clarification * doc/fftw3.texi: need make after bootstrap * doc/fftw3.texi: slight change * doc/fftw3.texi: libtool is also needed * doc/fftw3.texi: added code generator introduction * Makefile.am, configure.ac, genfft/Makefile.am, genfft/complex.ml, genfft/complex.mli, genfft/gen_r2r.ml, genfft/gen_trig.ml, genfft/trig.ml, rdft/Makefile.am, rdft/codelet-rdft.h, rdft/codelets/Makefile.am, rdft/codelets/r2r.c, rdft/codelets/r2r.h, rdft/codelets/r2r/Makefile.am, rdft/conf.c, rdft/direct.c, rdft/kr2r.c, rdft/rdft.h, support/Makefile.codelets: added support for REDFT/RODFT/DHT direct codelets * doc/FAQ/fftw-faq.bfnn: noted ARM bug; thanks to Jay Treacy 2003-03-25 Matteo Frigo * genfft-k7/vK7Optimization.ml: bugfix from Stefan 2003-03-24 Steven G. Johnson * doc/fftw3.texi: slight change * doc/fftw3.texi: caveat * doc/fftw3.texi: warning about DHT 2003-03-24 Matteo Frigo * dft/k7/codelets/Makefile.am: Oops * dft/k7/codelets/Makefile.am, tests/Makefile.am, tests/check.pl: Regression test for p4fftwgel 2003-03-24 Steven G. Johnson * tests/Makefile.am: make check is faster, old tests are in make bigcheck 2003-03-22 Steven G. Johnson * doc/FAQ/fftw-faq.bfnn: note * doc/FAQ/fftw-faq.bfnn: whoops, line wrapping 2003-03-21 Matteo Frigo * dft/codelets/standard/Makefile.am, genfft/gen_notw.ml, genfft/gen_twiddle.ml: Franz-mode codelets even without SIMD. (disabled) * doc/FAQ/fftw-faq.bfnn: Bug is in netbsd-1.6, not 1.5 * simd/simd-altivec.h: const cast, should placate c++ compilers. 2003-03-20 Steven G. Johnson * doc/FAQ/fftw-faq.bfnn: added FAQ on why plans are array-specific * reodft/reodft010e-r2hc.c: comment fix * reodft/reodft010e-r2hc.c: noted comparison to NR * api/fftw3.h: whoops, C99 complex didn't work if complex is a macro (as it is with glibc); thanks to Keh-Cheng Chu for the bug report 2003-03-19 Steven G. Johnson * configure.ac: noted in help that --enable-k7 enables 3dnow, and that --enable-3dnow is only a fallback 2003-03-19 Matteo Frigo * doc/FAQ/Makefile.am, doc/FAQ/fftw-faq.bfnn, doc/FAQ/html.refs: New gcc bug. html.refs was not in repository/distribution. * tests/bench.c: Don't write wisdom if you don't have it. 2003-03-18 Matteo Frigo * doc/fftw3.texi: Added index entries for DHT. Similarly for DCT, DST 2003-03-18 Steven G. Johnson * api/f77api.c, api/f77funcs.h: execute should not go through C api, for efficiency 2003-03-18 Matteo Frigo * api/fftw3.h: Renamed FFTW_IODIM, FFTW_R2R_KIND 2003-03-18 Steven G. Johnson * doc/Makefile.am: added rfftwnd.eps to dist, so that transfig is not required for people trying to build other formats (e.g. ps); thanks to Brian Gough for the bug report 2003-03-17 Steven G. Johnson * doc/fftw3.texi: pointer to upgrading section from tutorial * api/f77funcs.h, api/fftw3.h, api/print-plan.c, doc/fftw3.texi, tests/bench.c: make print_plan and fprint_plan, so that the former can be more easily called from other languages * doc/fftw3.texi: whoops, forgot to change equation image links to .png 2003-03-17 Matteo Frigo * api/fftw3.h, api/version.c, support/Makefile.codelets: fixed c++ linkage problems * api/fftw3.h, api/version.c: Removed ``const'', otherwise c++ link fails 2003-03-17 Steven G. Johnson * api/f77api.c, api/f77funcs.h, api/version.c, libbench2/allocate.c, libbench2/getopt-utils.c, libbench2/problem.c, libbench2/speed.c, libbench2/timer.c, libbench2/verify-r2r.c, libbench2/zero.c, support/Makefile.codelets, tests/bench.c, tests/hook.c, tools/fftw-wisdom.c: fixed C++ annoyances: void* casts, and global variables are static by default(?!?) 2003-03-16 Steven G. Johnson * doc/FAQ/fftw-faq.bfnn: ranlib bug is in binutils * doc/FAQ/fftw-faq.bfnn: ranlib Irix bug * tests/check.pl: start with random tests * kernel/ifftw.h, libbench2/verify-r2r.c, rdft/direct.c, rdft/direct2.c, threads/Makefile.am, api/Makefile.am, dft/direct.c: silenced some compiler warnings, eliminated unused variables, and fixed Makefile.am for f77funcs.h * doc/FAQ/fftw-faq.bfnn: whoops * doc/fftw3.texi: 3dnow is float * doc/fftw3.texi: fixed k7 docs * kernel/cycle.h: SGI compilers now support inline * kernel/cycle.h: cruft * doc/fftw3.texi: texinfo doesn't like commas in nodes * README, ChangeLog: updated * api/f77api.c, api/f77funcs.c, api/f77funcs.h, threads/f77api.c, threads/f77funcs.c, threads/f77funcs.h: f77funcs.c -> f77funcs.h so that people don't try to compile it * doc/FAQ/fftw-faq.bfnn: minor changes * doc/FAQ/fftw-faq.bfnn: updated compiler bug list * doc/fftw3.texi: noted how to set CC * TODO: TODONE * threads/vrank-geq1-rdft2.c: yikes, bugfix * kernel/ifftw.h: whoops 2003-03-16 Matteo Frigo * api/version.c: Report SIMD extensions in version string 2003-03-15 Steven G. Johnson * tests/bench.c: more verbose output * doc/fftw3.texi: a couple of additional non-Unix instructions * doc/FAQ/fftw-faq.bfnn: hyphen * doc/FAQ/fftw-faq.bfnn: softened * configure.ac, doc/FAQ/Makefile.am, doc/FAQ/bfnnconv.pl, doc/FAQ/fftw-faq.bfnn, doc/FAQ/m-ascii.pl, doc/FAQ/m-html.pl, doc/FAQ/m-info.pl, doc/FAQ/m-lout.pl, doc/FAQ/m-post.pl, doc/Makefile.am, doc/equation-dft.gif, doc/equation-dft.png, doc/equation-dht.gif, doc/equation-dht.png, doc/equation-idft.gif, doc/equation-idft.png, doc/equation-redft00.gif, doc/equation-redft00.png, doc/equation-redft01.gif, doc/equation-redft01.png, doc/equation-redft10.gif, doc/equation-redft10.png, doc/equation-redft11.gif, doc/equation-redft11.png, doc/equation-rodft00.gif, doc/equation-rodft00.png, doc/equation-rodft01.gif, doc/equation-rodft01.png, doc/equation-rodft10.gif, doc/equation-rodft10.png, doc/equation-rodft11.gif, doc/equation-rodft11.png: added FAQ, used PNGs * genfft-k7/assoctable.ml, genfft-k7/assoctable.mli, genfft-k7/expr.ml, genfft-k7/expr.mli, genfft-k7/fft.ml, genfft-k7/littlesimp.ml, genfft-k7/littlesimp.mli, genfft-k7/monads.ml, genfft-k7/number.ml, genfft-k7/number.mli, genfft-k7/oracle.ml, genfft-k7/oracle.mli, genfft-k7/to_alist.ml, genfft-k7/to_alist.mli, genfft-k7/twiddle.ml, genfft-k7/twiddle.mli, kernel/align.c, kernel/alloc.c, kernel/assert.c, kernel/awake.c, kernel/buffered.c, kernel/ct.c, kernel/cycle.h, kernel/debug.c, kernel/hash.c, kernel/iabs.c, kernel/ifftw.h, kernel/md5-1.c, kernel/md5.c, kernel/minmax.c, kernel/ops.c, kernel/pickdim.c, kernel/plan.c, kernel/planner.c, kernel/primes.c, kernel/print.c, kernel/problem.c, kernel/rader.c, kernel/scan.c, kernel/solver.c, kernel/solvtab.c, kernel/square.c, kernel/stride.c, kernel/tensor.c, kernel/tensor1.c, kernel/tensor2.c, kernel/tensor4.c, kernel/tensor5.c, kernel/tensor7.c, kernel/tensor8.c, kernel/tensor9.c, kernel/timer.c, kernel/trig.c, kernel/trig1.c, kernel/twiddle.c, libbench/bench-main.c, libbench/bench-user.h, libbench/bench.h, libbench/can-do.c, libbench/getopt-utils.c, libbench/info.c, libbench/main.c, libbench/prime.c, libbench/problem.c, libbench/report.c, libbench/speed.c, libbench/timer.c, libbench/util.c, libbench/verify.c, libbench/zero.c, libbench2/aligned-main.c, libbench2/bench-main.c, libbench2/bench-user.h, libbench2/bench.h, libbench2/can-do.c, libbench2/dotens2.c, libbench2/getopt-utils.c, libbench2/info.c, libbench2/main.c, libbench2/problem.c, libbench2/report.c, libbench2/speed.c, libbench2/tensor.c, libbench2/timer.c, libbench2/useropt.c, libbench2/util.c, libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify-r2r.c, libbench2/verify-rdft2.c, libbench2/verify.c, libbench2/verify.h, libbench2/zero.c, rdft/buffered.c, rdft/buffered2.c, rdft/codelet-rdft.h, rdft/codelets/hb.h, rdft/codelets/hc2r.c, rdft/codelets/hc2r.h, rdft/codelets/hc2rIII.h, rdft/codelets/hf.h, rdft/codelets/hfb.c, rdft/codelets/r2hc.c, rdft/codelets/r2hc.h, rdft/codelets/r2hcII.h, rdft/conf.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c, rdft/hc2hc.h, rdft/indirect.c, rdft/khc2hc-dif.c, rdft/khc2hc-dit.c, rdft/khc2r.c, rdft/kr2hc.c, rdft/nop.c, rdft/nop2.c, rdft/plan.c, rdft/plan2.c, rdft/problem.c, rdft/problem2.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rank0-rdft2.c, rdft/rank0.c, rdft/rdft-dht.c, rdft/rdft.h, rdft/rdft2-inplace-strides.c, rdft/rdft2-radix2.c, rdft/rdft2-tensor-max-index.c, rdft/solve.c, rdft/solve2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, reodft/conf.c, reodft/redft00e-r2hc.c, reodft/reodft.h, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc-odd.c, reodft/reodft11e-r2hc.c, reodft/reodft11e-radix2.c, reodft/rodft00e-r2hc.c, simd/3dnow.c, simd/altivec.c, simd/simd-3dnow.h, simd/simd-altivec.h, simd/simd-sse.h, simd/simd-sse2.h, simd/simd.h, simd/sse-aux.c, simd/sse.c, simd/sse2-aux.c, simd/sse2.c, threads/api.c, threads/conf.c, threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/f77api.c, threads/f77funcs.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/threads.c, threads/threads.h, threads/vrank-geq1-rdft2.c, tools/fftw-wisdom-to-conf.1, tools/fftw-wisdom-to-conf.in, tools/fftw-wisdom.c, tools/fftw_wisdom.1.in, COPYRIGHT, TODO, api/api.h, api/apiplan.c, api/configure.c, api/execute-dft-c2r.c, api/execute-dft-r2c.c, api/execute-dft.c, api/execute-r2r.c, api/execute.c, api/export-wisdom-to-file.c, api/export-wisdom-to-string.c, api/export-wisdom.c, api/extract-reim.c, api/f77api.c, api/f77funcs.c, api/fftw3.h, api/flops.c, api/forget-wisdom.c, api/import-system-wisdom.c, api/import-wisdom-from-file.c, api/import-wisdom-from-string.c, api/import-wisdom.c, api/map-r2r-kind.c, api/mapflags.c, api/mkprinter-file.c, api/mktensor-iodims.c, api/mktensor-rowmajor.c, api/plan-dft-1d.c, api/plan-dft-2d.c, api/plan-dft-3d.c, api/plan-dft-c2r-1d.c, api/plan-dft-c2r-2d.c, api/plan-dft-c2r-3d.c, api/plan-dft-c2r.c, api/plan-dft-r2c-1d.c, api/plan-dft-r2c-2d.c, api/plan-dft-r2c-3d.c, api/plan-dft-r2c.c, api/plan-dft.c, api/plan-guru-dft-c2r.c, api/plan-guru-dft-r2c.c, api/plan-guru-dft.c, api/plan-guru-r2r.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/plan-many-dft.c, api/plan-many-r2r.c, api/plan-r2r-1d.c, api/plan-r2r-2d.c, api/plan-r2r-3d.c, api/plan-r2r.c, api/print-plan.c, api/rdft2-pad.c, api/the-planner.c, api/version.c, api/x77.h, dft/buffered.c, dft/codelet-dft.h, dft/codelets/n.c, dft/codelets/n.h, dft/codelets/t.c, dft/codelets/t.h, dft/conf.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/ct.h, dft/dft.h, dft/direct.c, dft/generic.c, dft/indirect.c, dft/k7/k7.c, dft/kdft-dif.c, dft/kdft-difsq.c, dft/kdft-dit.c, dft/kdft.c, dft/nop.c, dft/plan.c, dft/problem.c, dft/rader-omega.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/simd/n1b.c, dft/simd/n1b.h, dft/simd/n1f.c, dft/simd/n1f.h, dft/simd/t1b.c, dft/simd/t1b.h, dft/simd/t1f.c, dft/simd/t1f.h, dft/solve.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, dft/zero.c, doc/f77_wisdom.f, doc/fftw3.texi, genfft-k7/algsimp.ml, genfft-k7/algsimp.mli, genfft/algsimp.ml, genfft/algsimp.mli, genfft/annotate.ml, genfft/annotate.mli, genfft/assoctable.ml, genfft/assoctable.mli, genfft/c.ml, genfft/c.mli, genfft/complex.ml, genfft/complex.mli, genfft/conv.ml, genfft/conv.mli, genfft/dag.ml, genfft/dag.mli, genfft/expr.ml, genfft/expr.mli, genfft/fft.ml, genfft/fft.mli, genfft/gen_athnotw.ml, genfft/gen_athtw.ml, genfft/gen_conv.ml, genfft/gen_hc2hc.ml, genfft/gen_hc2r.ml, genfft/gen_notw.ml, genfft/gen_notw_c.ml, genfft/gen_r2hc.ml, genfft/gen_trig.ml, genfft/gen_twiddle.ml, genfft/gen_twiddle_c.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, genfft/littlesimp.ml, genfft/littlesimp.mli, genfft/magic.ml, genfft/monads.ml, genfft/number.ml, genfft/number.mli, genfft/oracle.ml, genfft/oracle.mli, genfft/schedule.ml, genfft/schedule.mli, genfft/simd.ml, genfft/simd.mli, genfft/simdmagic.ml, genfft/to_alist.ml, genfft/to_alist.mli, genfft/trig.ml, genfft/trig.mli, genfft/twiddle.ml, genfft/twiddle.mli, genfft/unique.ml, genfft/unique.mli, genfft/util.ml, genfft/util.mli, genfft/variable.ml, genfft/variable.mli: great copyright update * TODO, tests/Makefile.am, tests/check.pl: threads in make check * threads/ct-dit.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c: fixed const warnings * threads/ct-dit.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c: make sure spawn_loop size > 1 (it has to be at least > 0 lest we crash, but > 1 is an optimization) 2003-03-15 Matteo Frigo * kernel/cycle.h: hpux seems to want machine/sys/inline.h as opposed to machine/inline.h. 2003-03-15 Steven G. Johnson * doc/fftw3.texi: Sourceforge is really SourceForge.net, and is run by VA * doc/fftw3.texi: comma * doc/fftw3.texi: fixed AMD company name * doc/fftw3.texi: minor changes * api/f77api.c, api/f77funcs.c: more emitter->read_char renaming * doc/fftw3.texi: more wisdom docs, noted wisdom utilities * doc/fftw3.texi: compound adjectives are hyphenated * doc/fftw3.texi: fftw does support another type of packed array via r2r * api/export-wisdom.c, api/f77api.c, api/f77funcs.c, api/fftw3.h, api/import-wisdom.c, doc/f77_wisdom.f, doc/fftw3.texi: write_char/read_char for export/import functions * threads/threads.c: comments 2003-03-15 Matteo Frigo * support/Makefile.codelets: Enabled randomized-cse * configure.ac: Changed to 3.0-beta1 * doc/fftw3.texi: First complete draft * api/fftw3.h, api/import-wisdom.c: EMITTER is a misnomer * doc/fftw3.texi: Revision, wisdom tutorial, acks. 2003-03-15 Steven G. Johnson * NEWS: noted OpenMP * threads/threads.c: comment * threads/threads.c: comments * threads/threads.c: reformatting * threads/threads.c: whoops * tests/bench.c, threads/api.c, threads/threads.c, threads/threads.h: some threads fixes, and added experimental semaphore (pre-thread-spawning) and Linux spinlock support * threads/f77funcs.c: whoops 2003-03-14 Steven G. Johnson * doc/fftw3.texi: added note that FFTW_PATIENT will disable threads if they are not beneficial * doc/fftw3.texi: made fftw_cleanup* more restrictive, in that we don't want to guarantee that previously created plans will still work (they won't, in the case of threaded plans and fftw_cleanup_threads), and there is no reason to provide such a guarantee anyway. 2003-03-14 Matteo Frigo * api/Makefile.am, api/version.c, kernel/Makefile.am, kernel/ifftw.h, kernel/version.c: Moved version.c from kernel/ into api/ * configure.ac: icc-7.0 requires -openmp * doc/Makefile.am: Ensure that one can do make dist given the distribution * doc/Makefile.am: Dist fftw3.pdf, not fftw.pdf * tests/bench.c: Support -onthreads=%d 2003-03-14 Steven G. Johnson * kernel/alloc.c: comment * threads/Makefile.am: whoops * doc/rfftwnd.fig: fftw_real is gone * doc/fftw3.texi: typos 2003-03-14 Matteo Frigo * api/fftw3.h, tests/bench.c: More BENCH_DOC strings * doc/fftw3.texi: Fixed xref's * doc/Makefile.am, doc/fftw3.texi, doc/rfftwnd.gif: Revised manual (esp. intro and tutorial), fixed texinfo hackery for figures. 2003-03-12 Steven G. Johnson * doc/fftw3.texi: redirect users from guru execute to advanced interface, if possible * doc/fftw3.texi: punctuation * doc/fftw3.texi: use correct heading level * doc/Makefile.am, doc/fftw3.texi: html generation * doc/equation-dft.gif, doc/equation-dht.gif, doc/equation-idft.gif, doc/equation-redft00.gif, doc/equation-redft01.gif, doc/equation-redft10.gif, doc/equation-redft11.gif, doc/equation-rodft00.gif, doc/equation-rodft01.gif, doc/equation-rodft10.gif, doc/equation-rodft11.gif: added equation GIFs * doc/fftw3.texi: punctuation * doc/fftw3.texi: added multi-dimensional transform definitions * doc/fftw3.texi: slight changes * doc/fftw3.texi: typo * doc/fftw3.texi: added 1d version of What FFTW Really Computes * doc/fftw3.texi: note in upgrading section about FFTW_PATIENT 2003-03-11 Steven G. Johnson * doc/fftw3.texi: added cycle-counter section * TODO: more ideas 2003-03-10 Steven G. Johnson * dft/indirect.c, rdft/indirect.c: noted that indirect should probably be merged with rank-geq2, to make a rank-split solver 2003-03-07 Steven G. Johnson * doc/fftw3.texi: added non-Unix installation instructions * doc/fftw3.texi: also talk about stack alignment with SSE/SSE2 * doc/fftw3.texi: made warning more dire * doc/fftw3.texi: fix * doc/fftw3.texi: number * doc/fftw3.texi: fix * doc/fftw3.texi: minor * doc/fftw3.texi: minor fix * doc/fftw3.texi: cross-ref * doc/fftw3.texi: minor * doc/fftw3.texi: more installation manual * doc/fftw3.texi: GNU-lly correct * doc/fftw3.texi: started installation section * configure.ac, kernel/timer.c: added --without-cycle-counter option as a last resort * kernel/cycle.h: macros with () arguments were only standardized in C99, and we don't need them anyway * doc/fftw3.texi: wording * doc/fftw3.texi: parallelism * doc/fftw3.texi: additions to upgrading chapter * doc/fftw3.texi: noted additional humility of FFTW 3 wisdom * doc/fftw3.texi: renaming * doc/fftw3.texi: added placeholder for wisdom reference * doc/fftw3.texi: wrote upgrading chapter 2003-03-06 Steven G. Johnson * doc/fftw3.texi: slight change * doc/fftw3.texi: placeholder for upgrade chapter * tools/fftw-wisdom.c: whoops * tools/fftw_wisdom.1.in: strengthed warning about time * tools/fftw_wisdom.1.in: noted -t in example * threads/f77api.c: pay attention to WINDOWS_F77_MANGLING * doc/fftw3.texi: punctuation * doc/fftw3.texi: index * doc/fftw3.texi: documented C++ usage * doc/fftw3.texi: got rid of overfull hbox TeX warnings * doc/fftw3.texi: whoops * doc/fftw3.texi: noted fftw_iodim split for Fortran guru interface * doc/fftw3.texi: added guru reference * doc/fftw3.texi: minor * doc/fftw3.texi: use @r{...} for comment text in code examples 2003-03-05 Steven G. Johnson * simd/sse.c: eliminate warning * configure.ac, dft/simd/Makefile.am, dft/simd/codelets/Makefile.am, kernel/align.c, simd/Makefile.am: SIMD_CFLAGS only for simd code 2003-03-05 Matteo Frigo * doc/fftw3.texi: Minor changes. 2003-03-05 Steven G. Johnson * api/f77api.c, configure.ac: cross-compiling with MinGW can't detect f77 mangling, so add an option to use what seems to be the most common styles * libbench2/util.c: comment * libbench2/util.c: we only use our-malloc-16 on machines where size_t == uintptr_t, so don't bother doing the right thing with the benchmark * libbench2/util.c: support WITH_OUR_MALLOC16 2003-03-04 FFTW Maintainers * configure.ac: automatically add -msse etcetera for --enable-sse etcetera * tools/fftw-wisdom.c: got rid of const warning * libbench2/problem.c: missing header 2003-03-04 Steven G. Johnson * doc/fftw3.texi: fixes * api/import-system-wisdom.c: whoops * doc/fftw3.texi: started guru reference * api/fftw3.h: use same FFTW_IODIM between precisions * doc/fftw3.texi: renamed section * doc/fftw3.texi: no need for "advanced" in subheadings * doc/fftw3.texi: typo * doc/fftw3.texi: finished advanced interface * doc/fftw3.texi: more advance interface docs * api/import-system-wisdom.c: fail for win32 2003-03-03 FFTW Maintainers * configure.ac: shortened help string * doc/fftw3.texi: fixed cross-refs * api/fftw3.h, api/mapflags.c, doc/fftw3.texi, tests/bench.c: FFTW_POSSIBLY_UNALIGNED -> simpler FFTW_UNALIGNED in API, added bench option * kernel/alloc.c: whoops * kernel/alloc.c: noted assumption * configure.ac, kernel/alloc.c: provide our own malloc16 routine because of Windows lossage 2003-03-03 Steven G. Johnson * doc/fftw3.texi: capitalization * doc/fftw3.texi: whoops * doc/fftw3.texi: vertical skip looks better than indenting for setting off short paragraphs 2003-03-03 Matteo Frigo * configure.ac, dft/simd/codelets/Makefile.am: Removed franz-mode. Automake was distributing franz files whether franz mode was enabled or not. 2003-03-03 Steven G. Johnson * doc/fftw3.texi: made output boundary conditions more prominent; they are important, because they make the different transform types inequivalent in parity * doc/fftw3.texi: clarification * doc/fftw3.texi: typo * doc/fftw3.texi: started advanced reference * doc/fftw3.texi: r2r reference * doc/fftw3.texi: workaround for info formatting bug * doc/fftw3.texi: noted lack of fftw_malloc in Fortran * doc/fftw3.texi: parallelism * doc/fftw3.texi: whoops * doc/fftw3.texi: r2c/c2r reference * doc/fftw3.texi: table of contents was being included twice * doc/fftw3.texi: minor changes * doc/fftw3.texi: started reference section * doc/Makefile.am: whoops * doc/fftw3.texi: started ref. section 2003-03-02 Steven G. Johnson * api/fftw3.h, api/flops.c: fftw_flops takes const plan * doc/fftw3.texi: typo * doc/fftw3.texi: added "Wisdom of Fortran?" section * doc/f77_wisdom.f: typo * doc/f77_wisdom.f: wording * doc/f77_wisdom.f: added comments * doc/f77_wisdom.f: added example file * tests/bench.c: don't print out READ WISDOM unless we have * kernel/scan.c: EOF is not a space 2003-03-02 Matteo Frigo * kernel/ifftw.h: Turn on inline by default * genfft/gen_hc2r.ml, genfft/gen_notw.ml, genfft/gen_notw_c.ml, genfft/gen_r2hc.ml, kernel/ifftw.h: Optionally inline loop in notw codelets 2003-03-02 Steven G. Johnson * doc/fftw3.texi: updated nodes * doc/fftw3.texi: wrote most of Fortran chapter * doc/fftw3.texi: citation * doc/fftw3.texi: added parallel FFTW chapter * doc/fftw3.texi: typo * TODO: added inlining to TODO * CONVENTIONS: added K * dft/zero.c, kernel/trig1.c, rdft/generic.c, rdft/problem.c, rdft/rdft-dht.c, rdft/rdft2-radix2.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/reodft11e-radix2.c, reodft/rodft00e-r2hc.c: use K for constants * doc/fftw3.texi: fixed cross-ref * doc/fftw3.texi: whoops 2003-03-01 Steven G. Johnson * doc/fftw3.texi: cleanup * doc/fftw3.texi: "words of wisdom" by itself is a little too obscure * doc/fftw3.texi: re-added multi-dimensional array stuff * doc/fftw3.texi: added alignment section * reodft/reodft11e-r2hc-odd.c: shrunk code * reodft/reodft11e-r2hc-odd.c: slight compression * doc/fftw3.texi, reodft/reodft11e-radix2.c: style 2003-02-28 Steven G. Johnson * CONVENTIONS: noted not in API * CONVENTIONS: more updates * CONVENTIONS: slight updates * api/f77funcs.c, api/fftw3.h, api/print-plan.c, dft/buffered.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/dft.h, dft/direct.c, dft/generic.c, dft/indirect.c, dft/nop.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/solve.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/ifftw.h, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c, rdft/indirect.c, rdft/nop.c, rdft/nop2.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rank0-rdft2.c, rdft/rank0.c, rdft/rdft-dht.c, rdft/rdft.h, rdft/rdft2-radix2.c, rdft/solve.c, rdft/solve2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc-odd.c, reodft/reodft11e-r2hc.c, reodft/reodft11e-radix2.c, reodft/rodft00e-r2hc.c, threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: great const-ification of apply/solve and print * api/execute-dft-c2r.c, api/execute-dft-r2c.c, api/execute-dft.c, api/execute-r2r.c, api/execute.c, api/f77funcs.c, api/fftw3.h, doc/fftw3.texi: make fftw_execute take a const plan, to remind the user that it is re-entrant (or should be)... * doc/fftw3.texi: weakening * doc/fftw3.texi: note * doc/fftw3.texi: footnote about why DHT is provided * doc/fftw3.texi: index * doc/fftw3.texi: added DHT tutorial * doc/fftw3.texi: fixed O(n log n) * doc/fftw3.texi: whoops * doc/fftw3.texi: slight improvements * doc/fftw3.texi: addition * doc/fftw3.texi: clarification * doc/fftw3.texi: fix * doc/fftw3.texi: slight changes * doc/fftw3.texi: added R{E,O}DFTab tutorial 2003-02-27 Steven G. Johnson * doc/fftw3.texi: fixes * doc/fftw3.texi: slight change * doc/fftw3.texi: documented r2hc/hc2r * doc/fftw3.texi: minor changes * TODO: timed planner and unifying radix-2 butterfly loops are not critical for release * TODO: reodft/verify.c no longer exists * rdft/problem.c: optimization: REDFT00 of size 2 is same as R2HC * rdft/problem.c: R{E,O}DFT01 of size-1 is identity * reodft/reodft11e-r2hc-odd.c: minor simplification * reodft/reodft11e-r2hc-odd.c: fixed add count * reodft/reodft11e-r2hc-odd.c: whoops * reodft/reodft11e-r2hc-odd.c: another optimization * reodft/reodft11e-r2hc-odd.c, reodft/reodft11e-radix2.c: added op counts * reodft/reodft11e-r2hc-odd.c: cleanup * reodft/reodft11e-r2hc-odd.c: typo in comment * reodft/reodft11e-r2hc-odd.c: fixed comment * reodft/reodft11e-r2hc-odd.c: use E instead of R * reodft/reodft11e-r2hc-odd.c: more unrolling to eliminate if statements in loops, for speedups of 25-40% * reodft/reodft11e-r2hc-odd.c: some loop splitting to touch each element of output buf only once and eliminate some conditionals...speeds up by 30-40% 2003-02-26 Steven G. Johnson * reodft/reodft11e-r2hc-odd.c: comma * reodft/reodft11e-radix2.c: pointer to odd case * reodft/reodft11e-r2hc.c: precision -> accuracy (c.f. Kahan) * Makefile.am, libbench2/bench-user.h, libbench2/problem.c, tools/fftw-wisdom.c, tools/fftw_wisdom.1.in: added time limit for wisdom generation * reodft/reodft11e-r2hc-odd.c: caps * reodft/reodft11e-r2hc-odd.c: another note * reodft/reodft11e-r2hc-odd.c: note * configure.ac, kernel/alloc.c, kernel/ifftw.h, libbench2/bench-main.c, libbench2/bench.h, libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify-r2r.c, libbench2/verify-rdft2.c, libbench2/verify.c, libbench2/verify.h, reodft/Makefile.am, reodft/conf.c, reodft/reodft.h, reodft/reodft11e-r2hc-odd.c, reodft/reodft11e-r2hc.c, reodft/reodft11e-radix2.c, tests/bench.c: added new, more accurate (hopefully) reodft11 algorithms; added --disable-debug-malloc; added --impulse-accuracy-rounds=rounds flags to libbench2 for impulse-response accuracy tests 2003-02-23 Matteo Frigo * tools/Makefile.am: fftw_wisdom.1 is in $builddir, not $srcdir 2003-02-17 Steven G. Johnson * doc/fftw3.texi: pde * doc/fftw3.texi: consistent number * doc/fftw3.texi: started r2r doc * doc/rfftwnd.gif, doc/Makefile.am, doc/fftw3.texi, doc/rfftwnd.fig: rfftwnd 2003-02-15 Steven G. Johnson * doc/fftw3.texi: continued * doc/fftw3.texi: started r2c/c2r docs * libbench2/verify-r2r.c: added r{e,o}dft11 accuracy test * libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify-r2r.c, libbench2/verify-rdft2.c, libbench2/verify.h: added more r2r accuracy checks 2003-02-15 Matteo Frigo * tools/Makefile.am: $< is a GNUism 2003-02-13 Steven G. Johnson * TODO: r2r test cases are in * TODO: added vector radix to TODO 2003-02-12 Steven G. Johnson * tools/fftw_wisdom.1.in: fixed cross-ref * tools/fftw_wisdom.1.in: shorter synopsis * tests/debug.h: obsolete * tests/dotens.c, tests/dotens2.c: removed old dotens * tests/verify-dft.c, tests/verify-lib.c, tests/verify-rdft.c, tests/verify-reodft.c, tests/verify.h: removed old verify files * tools/fftw-wisdom.c, tools/fftw_wisdom.1.in: disable threads support by default 2003-02-12 Matteo Frigo * tests/bench.c: Removed old test program 2003-02-12 Steven G. Johnson * tools/fftw-wisdom-to-conf.in: joke * tools/fftw-wisdom-to-conf.1, tools/fftw-wisdom-to-conf.in: add --help and --version, to be GNU-lly correct * tools/fftw_wisdom.1.in: whoops * tools/fftw-wisdom.c: better help * tools/fftw-wisdom-to-conf.1: comma * tools/fftw-wisdom-to-conf.1: formatting * configure.ac, tools/Makefile.am, tools/fftw-wisdom-to-conf.1, tools/fftw_wisdom.1.in: man pages for tools * tools/fftw-wisdom.c: added -V 2003-02-11 Steven G. Johnson * Makefile.am: added install-wisdom target * NEWS: another note * libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify-r2r.c, libbench2/verify-rdft2.c, libbench2/verify.c, libbench2/verify.h: started r2r accuracy tests (only three kinds covered so far) * kernel/ifftw.h: silence warning 2003-02-11 Matteo Frigo * TODO: gcc bug is now avoided. * libbench2/Makefile.am, libbench2/bench-user.h, libbench2/mp.c, libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify-r2r.c, libbench2/verify-rdft2.c, libbench2/verify.c, libbench2/verify.h: Accuracy test 2003-02-10 Matteo Frigo * kernel/ifftw.h: There is no point in precomputing strides for the long-double code, as multiplication by sizeof(long double) cannot be folded into the addressing mode. This change also fixes the gcc-2.95 bug that causes miscompilation of certain codelets. 2003-02-10 Steven G. Johnson * tests/check.pl: added random r2r tests * reodft/reodft010e-r2hc.c: whoops, bugfix: missing stride for ro10 * api/mapflags.c: formatting * reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c: flop counts for reodft * libbench2/bench.h: declare aligned_main * rdft/dht-rader.c, rdft/rader-hc2hc.c: corrected rader op counts * TODO: punctuation * TODO: noted need for better estimator * NEWS: noted F77 api fix for g77 mangling incompatibility * api/Makefile.am: build f77 header file of constants from fftw3.h * TODO: updates * api/Makefile.am, api/f77api.c, api/x77.h, threads/Makefile.am, threads/f77api.c, threads/f77funcs.c: threads f77 api 2003-02-09 Steven G. Johnson * api/f77api.c, api/f77funcs.c: finished f77 serial api * api/f77api.c, api/f77funcs.c: added flops, slight cleanups 2003-02-09 Matteo Frigo * libbench2/aligned-main.c: Oops, forgot #include * libbench2/Makefile.am, libbench2/aligned-main.c, libbench2/bench-main.c, libbench2/main.c, tools/fftw-wisdom.c: Removed duplication of stack-alignment code 2003-02-09 Steven G. Johnson * tools/fftw-wisdom.c: allow - to read problems from stdin * tools/Makefile.am, tools/fftw-wisdom.c: added fftw-wisdom tool * tests/bench.c: elim. warning * tests/bench.c: destroy_input should not contaminate flags of other problems * ChangeLog: updated * rdft/rank-geq2.c, dft/rank-geq2.c, rdft/rank-geq2-rdft2.c: removed overzealous inplace check, which caused problems for rdft2 2003-02-09 Matteo Frigo * kernel/tensor.c: Consistent syntax for RNK_MINFTY tensors * kernel/tensor.c: lisply-correct tensor print. We no longer need to parse tensors. 2003-02-09 Steven G. Johnson * TODO: removed completed items * libbench2/verify-r2r.c: slight renaming * libbench2/problem.c, libbench2/verify-r2r.c: multi-dimensional r2r verifier * libbench2/verify-r2r.c: comments * libbench2/verify-r2r.c: slight simplification * libbench2/Makefile.am, libbench2/allocate.c, libbench2/bench-user.h, libbench2/mflops.c, libbench2/problem.c, libbench2/verify-r2r.c, libbench2/verify.c, libbench2/zero.c, tests/bench.c: added 1d r2r verifier (triple ugh) * tests/check.pl: added vector transforms to random tests * rdft/direct2.c: whoops * libbench2/problem.c: fixed interaction between dwims for sz/vecsz with rdft2 transforms * libbench2/bench-user.h, libbench2/problem.c, libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify-rdft2.c, libbench2/verify.h, tests/bench.c: added destroy_input flag/check * libbench2/Makefile.am, libbench2/allocate.c, libbench2/aset.c, libbench2/bench-user.h, libbench2/bench.h, libbench2/problem.c, libbench2/tensor.c, libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify-rdft2.c, libbench2/verify.c, libbench2/verify.h, libbench2/zero.c, tests/bench.c, tests/check.pl, api/Makefile.am, api/dfthelp.c, api/extract-reim.c, api/plan-guru-dft-c2r.c, api/plan-many-dft-c2r.c: added rdft2 verifier 2003-02-08 Steven G. Johnson * rdft/rdft2-radix2.c: an additional check for in-place case 2003-02-07 Steven G. Johnson * rdft/rank0-rdft2.c: slight fix: hc2r constraints are mostly determined by sub-plan * rdft/rdft2-radix2.c: make radix2-dft inapplicable to in-place/split case (r == rio, iio >= rio + n/2+1 != r + 1) 2003-02-04 Matteo Frigo * kernel/planner.c, tests/hook.c: Allow plnr->hook to be 0 2003-02-04 Steven G. Johnson * libbench2/bench-user.h, libbench2/verify-dft.c, libbench2/verify.c: moved dft stuff into verify-dft * tests/hook.c: cruft * libbench2/bench-user.h, libbench2/problem.c, libbench2/verify.c, tests/bench.c, tests/hook.c: further unify libbench2 and paranoid verifiers 2003-02-02 Steven G. Johnson * api/import-wisdom-from-file.c: typo in comment 2003-02-01 Matteo Frigo * kernel/primes.c: Fixed p==2 case * kernel/primes.c: Incorporated new find_generator by Greg Dionne. * libbench2/getopt.c: Removed nonportable call to gettext() 2003-01-30 Matteo Frigo * kernel/ifftw.h: uintptr_t is in in openbsd 2003-01-29 Matteo Frigo * api/export-wisdom-to-string.c, api/export-wisdom.c, api/import-wisdom-from-file.c, api/mkprinter-file.c, kernel/debug.c, kernel/ifftw.h, kernel/planner.c, kernel/print.c, kernel/scan.c, tests/bench.c: Huge speedups in wisdom I/O. * kernel/planner.c: Added appropriate warning against likely future bug. * kernel/planner.c: Don't attempt to remove bogus wisdom entries. 2003-01-28 Matteo Frigo * kernel/planner.c: Fixed a couple of very very very nasty bugs---pointers became invalid after the hash table was relocated. * tests/bench.c: Read wisdom at can_do() time, otherwise wisdom is destroyed. * kernel/planner.c: More conservative inheritance of blessings * dft/problem.c: Print the same info as it is hashed * tests/check.pl: Print name of executable when FAILURE 2003-01-27 Matteo Frigo * kernel/ifftw.h, kernel/planner.c: New NO_SEARCH planner flag, which avoids searching altogether. A wisdom entry must lead to a NO_SEARCH-grade plan, or else the wisdom entry is bogus. * libbench2/verify-lib.c: Use cosl()/sinl() when appropriate 2003-01-26 Matteo Frigo * kernel/planner.c, libbench2/problem.c, libbench2/speed.c, libbench2/verify.c: Use null pointers when estimating. The estimator should never time anything. 2003-01-26 Steven G. Johnson * api/f77api.c: note * api/Makefile.am, api/f77api.c, api/f77funcs.c, configure.ac: support multiple mangling schemes with g77 * tests/check.pl: fixed verbose, made random tests only use selected rank, use rank <= 4, fixed final flush_problems call * tests/check.pl: fixed typo (count instead of maxcount) * configure.ac: hypot is no longer used * configure.ac, kernel/ifftw.h: check for _alloca (MSVC) * kernel/alloc.c: slight fix in assert 2003-01-26 Matteo Frigo * libbench2/problem.c, libbench2/speed.c, libbench2/verify.c, tests/bench.c: Allocate problem in all cases--- can_do may need correct pointers. * tests/bench.c, tests/check.pl: Nastier checks * kernel/ifftw.h, kernel/plan.c, kernel/planner.c: X(use_plan) is a relic. * tests/Makefile.am: Print full pathname of the bench executable, so that I don't get confused when running multiple tests for different configurations. * libbench2/bench-main.c, libbench2/bench-user.h, tests/bench.c: Split done() into done() and cleanup(), in order to test multiple problems with the same planner from the command line. * kernel/alloc.c: Improved readability 2003-01-26 Steven G. Johnson * kernel/alloc.c: comment * kernel/alloc.c: added macos9 mpallocatealigned function 2003-01-25 Steven G. Johnson * kernel/alloc.c: sometimes __APPLE__ is defined instead of __MACOSX__ * kernel/alloc.c: macos x malloc is already 16-byte aligned 2003-01-25 Matteo Frigo * kernel/ifftw.h: Include because uintptr_t is defined there on solaris. * libbench2/Makefile.am, libbench2/getopt1.c: Oops---forgot getopt_long * configure.ac: Include default includes when checking for uintptr_t. (Otherwise solaris breaks.) * tests/Makefile.am: distribute check.pl * tests/check.pl: Check split format, too. * tests/Makefile.am, tests/check.pl: New tests, added make check 2003-01-23 Matteo Frigo * tests/check.pl: More tests 2003-01-22 Matteo Frigo * libbench2/problem.c, api/mktensor-iodims.c, api/mktensor-rowmajor.c: Deal with rnk(sz)=-infinity 2003-01-21 Matteo Frigo * TODO: Crazy idea * tests/check.pl: Test program, still barely worthy of the name. 2003-01-20 Matteo Frigo * libbench2/problem.c: Stylistic changes * api/Makefile.am, api/fftw3.h, api/flops.c, tests/bench.c: Implemented flops api 2003-01-19 Steven G. Johnson * libbench2/problem.c: cleanup * libbench2/problem.c: 'v' syntax now defaults to an 'internal' (stride 1) vector, which is a more interesting case and corresponds more closely to the intuitive notion of a 'vector' transform, while '*' does the old 'external' (stride n) vector * libbench2/problem.c: removed '/' overloading * libbench2/problem.c: get rid of '*' and ',' synonyms for 'x' in problem parser; there's no need to clutter the namespace with syntax we never use 2003-01-19 Matteo Frigo * kernel/planner.c: Signed/unsigned fixes. * libbench2/bench-user.h, libbench2/verify-dft.c, libbench2/verify.c, libbench2/verify.h, tests/bench.c, tests/hook.c: Test split arrays. 2003-01-19 Steven G. Johnson * doc/fftw3.texi: clarification * doc/fftw3.texi: caps * doc/fftw3.texi: brackets * doc/fftw3.texi: quote * doc/fftw3.texi: referencing * doc/fftw3.texi: fix * doc/fftw3.texi: slight change 2003-01-19 Matteo Frigo * libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify.h, tests/bench.c: Print errors when --verify. 2003-01-19 Steven G. Johnson * doc/fftw3.texi: improved description, noted that FFTW_ESTIMATE does not destroy arrays * api/fftw3.h: FFTW_DEFAULTS isn't really needed * api/fftw3.h, doc/fftw3.texi: added FFTW_MEASURE synonym for FFTW_DEFAULTS * kernel/alloc.c: slight change 2003-01-19 Matteo Frigo * tests/bench.c: Clearer name * api/fftw3.h, libbench2/tensor.c, tests/bench.c: Completed dft api test 2003-01-19 Steven G. Johnson * doc/fftw3.texi: index * doc/fftw3.texi: fix * doc/fftw3.texi: parallel structure * doc/fftw3.texi: fix * doc/fftw3.texi: joke * doc/fftw3.texi: recommendation to read tutorial in-order * doc/fftw3.texi: expanded outline * doc/fftw3.texi: clarification * doc/fftw3.texi: draft complex-dft tutorial 2003-01-18 Matteo Frigo * libbench2/allocate.c, libbench2/bench-main.c, libbench2/bench-user.h, libbench2/bench.h, libbench2/can-do.c, libbench2/dotens2.c, libbench2/info.c, libbench2/problem.c, libbench2/report.c, libbench2/speed.c, libbench2/timer.c, libbench2/util.c, libbench2/verify.c, libbench2/verify.h, libbench2/zero.c, tests/Makefile.am, tests/bench.c, tests/hook.c: Paranoid mode is back. Fixed dwim to do what I mean. 2003-01-18 Steven G. Johnson * doc/fftw3.texi: started tut. 2003-01-18 Matteo Frigo * libbench2/allocate.c, libbench2/bench-user.h, libbench2/bench.h, libbench2/can-do.c, libbench2/dotens2.c, libbench2/mflops.c, libbench2/problem.c, libbench2/report.c, libbench2/speed.c, libbench2/tensor.c, libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify.c, libbench2/verify.h, libbench2/zero.c, tests/bench.c: Great renaming, so that we can include both bench-user.h and ifftw.h to implement the paranoid-mode hook. * libbench2/bench-user.h, libbench2/problem.c, libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify.c, libbench2/verify.h: Trying to tweak the verifier so that I can use it in bench.c for paranoid mode * tests/bench.c: Added stride_factor for complex arrays. * tests/bench.c: can_do now calls the planner. * api/plan-guru-dft.c, tests/bench.c: Call guru api in bench.c * libbench2/bench.h, libbench2/zero.c: Fixed prototype. * api/api.h, api/apiplan.c, api/fftw3.h, api/mapflags.c, api/plan-dft-1d.c, api/plan-dft-2d.c, api/plan-dft-3d.c, api/plan-dft-c2r-1d.c, api/plan-dft-c2r-2d.c, api/plan-dft-c2r-3d.c, api/plan-dft-c2r.c, api/plan-dft-r2c-1d.c, api/plan-dft-r2c-2d.c, api/plan-dft-r2c-3d.c, api/plan-dft-r2c.c, api/plan-dft.c, api/plan-guru-dft-c2r.c, api/plan-guru-dft-r2c.c, api/plan-guru-dft.c, api/plan-guru-r2r.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/plan-many-dft.c, api/plan-many-r2r.c, api/plan-r2r-1d.c, api/plan-r2r-2d.c, api/plan-r2r-3d.c, api/plan-r2r.c, kernel/ifftw.h: Attempt to make the signed/unsigned use of flags consistent. * libbench2/Makefile.am, libbench2/bench-main.c, libbench2/bench-user.h, libbench2/useropt.c, tests/bench.c: Implemented useropt. * api/mapflags.c: The first map_flags pass must be transitive, i.e., always use the latest flags value as opposed to the original value. (I think.) * tests/Makefile.am, libbench2/Makefile.am, libbench2/bench-user.h, libbench2/dotens2.c, libbench2/tensor.c, libbench2/verify-dft.c, libbench2/verify-lib.c, libbench2/verify.c, libbench2/verify.h: Started working on verifier 2003-01-17 Steven G. Johnson * api/fftw3.h, threads/api.c, threads/threads.c, threads/threads.h: added X(threads_cleanup) 2003-01-17 Matteo Frigo * libbench2/allocate.c, libbench2/tensor.c: Use C style for upper and lower array bounds. Free tensors properly. * libbench2/problem.c: Fixed ambiguous syntax * libbench2/problem.c: Parse minus sign, bugfixes * Makefile.am, configure.ac, libbench2/Makefile.am, libbench2/allocate.c, libbench2/bench-main.c, libbench2/bench-user.h, libbench2/bench.h, libbench2/can-do.c, libbench2/caset.c, libbench2/getopt-utils.c, libbench2/getopt.c, libbench2/getopt.h, libbench2/info.c, libbench2/main.c, libbench2/mflops.c, libbench2/ovtpvt.c, libbench2/pow2.c, libbench2/problem.c, libbench2/report.c, libbench2/speed.c, libbench2/tensor.c, libbench2/timer.c, libbench2/util.c, libbench2/verify.c, libbench2/zero.c, tests/Makefile.am, tests/bench.c: Skeleton libbench2 implemented (probably still buggy) * kernel/tensor4.c: Formatting 2003-01-17 FFTW Maintainers * doc/fftw3.texi: slight updates 2003-01-17 Steven G. Johnson * dft/vrank-geq1.c, kernel/buffered.c, kernel/ifftw.h, kernel/minmax.c, kernel/tensor4.c, rdft/buffered2.c, rdft/rdft2-inplace-strides.c, rdft/rdft2-tensor-max-index.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: eliminated obsolete uimin/uimax * Makefile.am, api/Makefile.am, api/configure.c, api/fftw3.h, api/plan-with-nthreads.c, tests/Makefile.am, tests/bench.c, threads/Makefile.am, threads/api.c, threads/threads.c: threads needs to have its own library, lest all programs linking to libfftw3.so need -lpthread * api/f77api.c: whoops * api/f77api.c: better name * api/f77api.c: added more functions 2003-01-16 Steven G. Johnson * kernel/ifftw.h: if 'long' is big enough, use it for mulmod in preference to 'long long' * configure.ac, kernel/align.c, kernel/ifftw.h: use uintptr_t for pointer alignment arithmetic 2003-01-16 Matteo Frigo * kernel/planner.c, kernel/print.c, kernel/tensor.c, kernel/twiddle.c, rdft/problem.c: More signed/unsigned cleanup * kernel/solvtab.c: null function pointers are technically nonportable * libbench/bench-main.c: Free short_options * kernel/alloc.c, kernel/ifftw.h, tests/bench.c: Oops, forgot STACK_FREE * kernel/alloc.c, kernel/ifftw.h: Do not require memalign() unless HAVE_SIMD 2003-01-16 Steven G. Johnson * kernel/alloc.c: MS VC++ _aligned_malloc * api/fftw3.h, kernel/alloc.c: added api fftw_malloc/free * api/map-r2r-kind.c: silence warning * tools/fftw-wisdom-to-conf.in: send error output to stderr 2003-01-15 Matteo Frigo * kernel/tensor7.c: Pure paranoia. * api/api.h, api/apiplan.c, api/configure.c, api/dfthelp.c, api/execute-dft-c2r.c, api/execute-dft-r2c.c, api/execute-dft.c, api/execute-r2r.c, api/execute.c, api/export-wisdom-to-file.c, api/export-wisdom-to-string.c, api/export-wisdom.c, api/f77api.c, api/fftw3.h, api/forget-wisdom.c, api/import-system-wisdom.c, api/import-wisdom-from-file.c, api/import-wisdom-from-string.c, api/import-wisdom.c, api/map-r2r-kind.c, api/mapflags.c, api/mkprinter-file.c, api/mktensor-iodims.c, api/mktensor-rowmajor.c, api/plan-dft-1d.c, api/plan-dft-2d.c, api/plan-dft-3d.c, api/plan-dft-c2r-1d.c, api/plan-dft-c2r-2d.c, api/plan-dft-c2r-3d.c, api/plan-dft-c2r.c, api/plan-dft-r2c-1d.c, api/plan-dft-r2c-2d.c, api/plan-dft-r2c-3d.c, api/plan-dft-r2c.c, api/plan-dft.c, api/plan-guru-dft-c2r.c, api/plan-guru-dft-r2c.c, api/plan-guru-dft.c, api/plan-guru-r2r.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/plan-many-dft.c, api/plan-many-r2r.c, api/plan-r2r-1d.c, api/plan-r2r-2d.c, api/plan-r2r-3d.c, api/plan-r2r.c, api/plan-with-nthreads.c, api/print-plan.c, api/rdft2-pad.c, api/the-planner.c, dft/buffered.c, dft/ct.c, dft/direct.c, dft/generic.c, dft/problem.c, dft/rader.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/Makefile.am, kernel/alloc.c, kernel/ifftw.h, kernel/planner.c, kernel/print.c, kernel/tensor.c, kernel/tensor9.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc.c, rdft/problem.c, rdft/problem2.c, rdft/rader-hc2hc.c, rdft/rdft-dht.c, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, threads/dft-vrank-geq1.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: Fixed formatting that was messed up by the conversion uint->int. Ensure that iodims etc are kosher. 2003-01-15 Steven G. Johnson * tools/fftw-wisdom-to-conf.in: added version stamp * tools/fftw-wisdom-to-conf.in: added warning * tools/Makefile.am: add fftw-wisdom-to-conf to BUILT_SOURCES * tools/fftw-wisdom-to-conf.in: added const * Makefile.am, configure.ac, tools/Makefile.am, tools/fftw-wisdom-to-conf.in: added wisdom-to-conf * kernel/planner.c: include type prefix in wisdom preamble * TODO: updates * tests/bench.c: check the_plan before printing 2003-01-15 Matteo Frigo * ChangeLog, api/api.h, api/apiplan.c, api/configure.c, api/dfthelp.c, api/execute-dft-c2r.c, api/execute-dft-r2c.c, api/execute-dft.c, api/execute-r2r.c, api/execute.c, api/export-wisdom-to-file.c, api/export-wisdom-to-string.c, api/export-wisdom.c, api/f77api.c, api/fftw3.h, api/forget-wisdom.c, api/import-system-wisdom.c, api/import-wisdom-from-file.c, api/import-wisdom-from-string.c, api/import-wisdom.c, api/map-r2r-kind.c, api/mapflags.c, api/mkprinter-file.c, api/mktensor-iodims.c, api/mktensor-rowmajor.c, api/plan-dft-1d.c, api/plan-dft-2d.c, api/plan-dft-3d.c, api/plan-dft-c2r-1d.c, api/plan-dft-c2r-2d.c, api/plan-dft-c2r-3d.c, api/plan-dft-c2r.c, api/plan-dft-r2c-1d.c, api/plan-dft-r2c-2d.c, api/plan-dft-r2c-3d.c, api/plan-dft-r2c.c, api/plan-dft.c, api/plan-guru-dft-c2r.c, api/plan-guru-dft-r2c.c, api/plan-guru-dft.c, api/plan-guru-r2r.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/plan-many-dft.c, api/plan-many-r2r.c, api/plan-r2r-1d.c, api/plan-r2r-2d.c, api/plan-r2r-3d.c, api/plan-r2r.c, api/plan-with-nthreads.c, api/print-plan.c, api/rdft2-pad.c, api/the-planner.c, configure.ac, dft/buffered.c, dft/codelet-dft.h, dft/codelets/n.c, dft/codelets/t.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/ct.h, dft/dft.h, dft/direct.c, dft/generic.c, dft/indirect.c, dft/k7/k7.c, dft/problem.c, dft/rader-omega.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/simd/n1b.c, dft/simd/n1f.c, dft/simd/t1b.c, dft/simd/t1f.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, dft/zero.c, genfft/gen_hc2hc.ml, genfft/gen_hc2r.ml, genfft/gen_notw.ml, genfft/gen_notw_c.ml, genfft/gen_r2hc.ml, genfft/gen_twiddle.ml, genfft/gen_twiddle_c.ml, genfft/gen_twidsq.ml, kernel/align.c, kernel/buffered.c, kernel/ct.c, kernel/hash.c, kernel/iabs.c, kernel/ifftw.h, kernel/md5-1.c, kernel/md5.c, kernel/minmax.c, kernel/ops.c, kernel/pickdim.c, kernel/planner.c, kernel/primes.c, kernel/print.c, kernel/rader.c, kernel/scan.c, kernel/tensor.c, kernel/tensor1.c, kernel/tensor2.c, kernel/tensor4.c, kernel/tensor5.c, kernel/tensor7.c, kernel/trig.c, kernel/twiddle.c, libbench/acopy.c, libbench/allocate.c, libbench/ascale.c, libbench/aset.c, libbench/bench-user.h, libbench/bench.h, libbench/caadd.c, libbench/cacopy.c, libbench/cascale.c, libbench/caset.c, libbench/casub.c, libbench/copy-c2h-1d-fftpack.c, libbench/copy-c2h-1d-halfcomplex.c, libbench/copy-c2h-1d-packed.c, libbench/copy-c2h-1d-unpacked-ri.c, libbench/copy-c2h-unpacked.c, libbench/copy-c2r-packed.c, libbench/copy-c2r-unpacked.c, libbench/copy-c2ri.c, libbench/copy-h2c-1d-fftpack.c, libbench/copy-h2c-1d-halfcomplex.c, libbench/copy-h2c-1d-packed.c, libbench/copy-h2c-1d-unpacked-ri.c, libbench/copy-h2c-unpacked.c, libbench/copy-r2c-packed.c, libbench/copy-r2c-unpacked.c, libbench/copy-ri2c.c, libbench/getopt-utils.c, libbench/getopt.c, libbench/log2.c, libbench/mp.c, libbench/pow2.c, libbench/prime.c, libbench/problem.c, libbench/timer.c, libbench/verify.c, rdft/buffered.c, rdft/buffered2.c, rdft/codelet-rdft.h, rdft/codelets/hc2r.c, rdft/codelets/hfb.c, rdft/codelets/r2hc.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c, rdft/hc2hc.h, rdft/indirect.c, rdft/problem.c, rdft/problem2.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rank0-rdft2.c, rdft/rank0.c, rdft/rdft-dht.c, rdft/rdft.h, rdft/rdft2-inplace-strides.c, rdft/rdft2-radix2.c, rdft/rdft2-tensor-max-index.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, simd/3dnow.c, simd/sse.c, simd/sse2.c, tests/bench.c, tests/dotens.c, tests/dotens2.c, tests/trigtest.c, tests/verify-dft.c, tests/verify-lib.c, tests/verify-rdft.c, tests/verify-reodft.c, tests/verify.h, threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/threads.c, threads/threads.h, threads/vrank-geq1-rdft2.c: Eliminated those unsigned values that would break LP64 machines. 2003-01-14 Steven G. Johnson * kernel/primes.c: comments 2003-01-14 Matteo Frigo * dft/generic.c, rdft/generic.c: Oops * dft/generic.c, rdft/generic.c: int/uint confusion 2003-01-14 Steven G. Johnson * doc/fftw3.texi: updated introduction and some organization * api/f77api.c: whoops * Makefile.am: newline * libbench/timer.c: added win32 timer * libbench/util.c: sync with kernel/alloc.c * api/f77api.c: handle missing F77_FUNC_ 2003-01-13 Steven G. Johnson * api/f77api.c: used fint instead of int to make Fortran integer type easier to change * api/f77api.c: slight abbreviation * api/Makefile.am, api/api.h, api/f77api.c, api/fftw3.h, api/mktensor-rowmajor.c, api/plan-dft-1d.c, api/plan-dft-2d.c, api/plan-dft-3d.c, api/plan-dft-c2r-1d.c, api/plan-dft-c2r-2d.c, api/plan-dft-c2r-3d.c, api/plan-dft-c2r.c, api/plan-dft-r2c-1d.c, api/plan-dft-r2c-2d.c, api/plan-dft-r2c-3d.c, api/plan-dft-r2c.c, api/plan-dft.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/plan-many-dft.c, api/plan-many-r2r.c, api/plan-r2r-1d.c, api/plan-r2r-2d.c, api/plan-r2r-3d.c, api/plan-r2r.c, api/rdft2-pad.c, kernel/Makefile.am, kernel/ifftw.h, kernel/tensor3.c, tests/bench.c: the great lengthening, part I: int -> long in api; mv mktensor-rowmajor to api * configure.ac: long types 2003-01-13 Matteo Frigo * kernel/print.c, kernel/problem.c, kernel/rader.c, kernel/scan.c, kernel/solver.c, kernel/stride.c, kernel/tensor.c, kernel/twiddle.c, rdft/buffered.c, rdft/buffered2.c, rdft/dht-rader.c, rdft/generic.c, rdft/problem.c, rdft/problem2.c, rdft/rader-hc2hc.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, tests/verify-dft.c, tests/verify-rdft.c, tests/verify-reodft.c, threads/dft-vrank-geq1.c, threads/rdft-vrank-geq1.c, threads/threads.c, threads/vrank-geq1-rdft2.c, api/apiplan.c, api/export-wisdom-to-string.c, api/f77api.c, api/map-r2r-kind.c, api/plan-guru-r2r.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/plan-many-r2r.c, api/rdft2-pad.c, dft/buffered.c, dft/generic.c, dft/problem.c, dft/rader-omega.c, dft/rader.c, kernel/alloc.c, kernel/ifftw.h, kernel/plan.c, kernel/planner.c: Renamed fftw_malloc -> MALLOC, X(free) -> X(ifree), X(free0) -> X(ifree0), non_fftw_malloc -> NATIVE_MALLOC 2003-01-13 Steven G. Johnson * api/Makefile.am, api/f77api.c: added beginning of Fortran interface * configure.ac: add fortran mangling check * api/Makefile.am, api/execute-r2r.c, api/fftw3.h, api/plan-guru-r2r.c: added guru r2r interface * api/fftw3.h, api/plan-r2r-1d.c, api/plan-r2r-2d.c, api/plan-r2r-3d.c: whoops * api/Makefile.am, api/fftw3.h, api/map-r2r-kind.c, api/plan-many-r2r.c, api/plan-r2r-1d.c, api/plan-r2r-2d.c, api/plan-r2r-3d.c, api/plan-r2r.c: added r2r planner * configure.ac: more long-double checks * kernel/planner.c: slight regrouping * kernel/planner.c: added joke * api/Makefile.am, api/api.h, api/mktensor-rowmajor-pad.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c, api/rdft2-pad.c: simplified rdft2 padding * api/fftw3.h: added comment 2003-01-12 Steven G. Johnson * tests/bench.c: use latest api * api/fftw3.h, api/plan-dft-1d.c, api/plan-dft-2d.c, api/plan-dft-3d.c, api/plan-dft-c2r-1d.c, api/plan-dft-c2r-2d.c, api/plan-dft-c2r-3d.c, api/plan-dft-c2r.c, api/plan-dft-r2c-1d.c, api/plan-dft-r2c-2d.c, api/plan-dft-r2c-3d.c, api/plan-dft-r2c.c, api/plan-dft.c: nembed should only be in advanced (many) interface, not basic interface...only a handful of people over the years have ever requested that functionality. * api/fftw3.h, api/mapflags.c: impatient is default; generalize mapping functions using xor trick * api/plan-dft-r2c-3d.c, api/mktensor-rowmajor-pad.c, api/plan-dft-c2r-1d.c, api/plan-dft-c2r-2d.c, api/plan-dft-c2r-3d.c, api/plan-dft-r2c-1d.c, api/plan-dft-r2c-2d.c: use NULL nembed to signal padding * api/plan-many-dft.c: accept NULL nembed * api/Makefile.am, api/execute-dft-c2r.c, api/execute-dft-r2c.c, api/fftw3.h: added execute-dft-r2c/c2r * api/plan-dft.c: don't need dft.h * api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c: tensors are compressed in the problem, duh * kernel/alloc.c: noted that posix_memalign bug is now fixed, thanks to bug report by yours truly 2003-01-12 Matteo Frigo * api/plan-dft-3d.c, api/plan-dft-c2r-3d.c, api/plan-dft-r2c-3d.c: Bug: n[3] instead of n[2]. Bug was propagated by copy-and-paste. Grrr... * api/plan-dft.c: Express plan_dft() in terms of plan_many_dft() 2003-01-12 Steven G. Johnson * api/plan-guru-dft-c2r.c, api/plan-guru-dft-r2c.c, api/plan-guru-dft.c: whoops 2003-01-12 Matteo Frigo * Makefile.am, configure.ac, doc/Makefile.am, doc/fftw3.texi, genfft-k7/vK7Optimization.ml: Manual skeleton. 2003-01-12 Steven G. Johnson * api/Makefile.am, api/fftw3.h: added r2c/c2r guru api * api/plan-many-dft-c2r.c: FFTW_DESTROY_INPUT is default for c2r transforms * api/Makefile.am, api/fftw3.h, api/plan-dft-c2r-1d.c, api/plan-dft-c2r-2d.c, api/plan-dft-c2r-3d.c, api/plan-dft-c2r.c, api/plan-dft-r2c-1d.c, api/plan-dft-r2c-2d.c, api/plan-dft-r2c-3d.c, api/plan-dft-r2c.c: added more of r2c/c2r api * api/fftw3.h, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c: r2c doesn't have adjustible sign * TODO: note that copyright year is out of date * api/fftw3.h: updated api for r2c * api/mktensor-rowmajor-pad.c: removed annoying nophys == niphys case * api/Makefile.am, api/api.h, api/mktensor-rowmajor-pad.c, api/plan-many-dft-c2r.c, api/plan-many-dft-r2c.c: added basic r2c/c2r planner * api/plan-many-dft.c: dist should be in terms of complex values * api/fftw3.h: added plan-with-nthreads * api/Makefile.am, api/plan-with-nthreads.c: added function to set nthr 2003-01-11 Steven G. Johnson * api/fftw3.h: slight cleanup * api/mktensor-iodims.c: whoops * kernel/scan.c: maxlen is maximum string length, not including null termination * kernel/planner.c: imprt reverts hashtable on failure * api/fftw3.h: slight move * api/fftw3.h: stdio.h should be inlcuded outside of extern "C" * api/Makefile.am, api/api.h, api/fftw3.h: added guru planner API * api/fftw3.h: added FFTW_FORWARD/BACKWARD * api/Makefile.am, api/fftw3.h, api/plan-many-dft.c: added plan_many_dft * kernel/tensor3.c: indenting 2003-01-11 Matteo Frigo * tests/bench.c: Final \n * kernel/debug.c: Do not compile if not defined(FFTW_DEBUG), in order to avoid unused code in the shared library. * api/Makefile.am, api/api.h, api/export-wisdom-to-file.c, api/fftw3.h, api/mkprinter-file.c, api/print-plan.c, tests/bench.c: Implemented print_plan() 2003-01-11 Steven G. Johnson * api/apiplan.c, api/fftw3.h, tests/bench.c: changed the OOP-like plan_destroy to the more-grammatical destroy_plan * api/execute-dft.c, api/Makefile.am, api/fftw3.h: added guru execute_dft * api/export-wisdom-to-string.c: allow for malloc errors in wisdom string, since non-fftw-malloc * api/the-planner.c: cleanup should reset plnr to zero so that fftw can be restarted * api/fftw3.h, api/mapflags.c: NO_UGLY is an internal planner flag 2003-01-11 Matteo Frigo * api/plan-dft-1d.c, tests/bench.c: Written 1d api in terms of generic n-d api. The code is less compact but easier to test * tests/verify-lib.c, tests/verify-reodft.c, api/export-wisdom-to-file.c, api/export-wisdom-to-string.c, api/fftw3.h, api/import-wisdom-from-file.c, api/import-wisdom-from-string.c, kernel/alloc.c, kernel/assert.c, kernel/debug.c, kernel/ifftw.h, kernel/print.c, kernel/scan.c, tests/bench.c: Added wisdom to header file, made scanners/printer static. stdio.h no longer needed in fftw.h, removed. Probably the printer_file should be reintroduced in a separate file if we ever want to print plans... * api/Makefile.am, api/apiplan.c, api/fftw3.h, api/plan-dft-2d.c, api/plan-dft-3d.c, api/plan-dft.c, tests/bench.c: Implemented more APIs * api/fftw3.h, api/the-planner.c, tests/bench.c: Added cleanup() to API * api/api.h, api/apiplan.c, api/fftw3.h, dft/buffered.c, dft/ct.c, dft/generic.c, dft/indirect.c, dft/rader.c, dft/rank-geq2.c, dft/vrank-geq1.c, kernel/ifftw.h, kernel/plan.c, kernel/planner.c, libbench/bench-user.h, libbench/bench.h, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/generic.c, rdft/hc2hc.c, rdft/indirect.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rank0-rdft2.c, rdft/rdft-dht.c, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, tests/Makefile.am, tests/bench.c, threads/dft-vrank-geq1.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: Started new bench.c. I had to rename plan_destroy -> plan_destroy_internal to avoid conflicts with API 2003-01-11 Steven G. Johnson * api/Makefile.am, api/export-wisdom.c, api/import-system-wisdom.c, api/import-wisdom-from-file.c, api/import-wisdom-from-string.c, api/import-wisdom.c: fix types * api/export-wisdom-to-string.c: whoops * api/Makefile.am, api/export-wisdom-to-file.c, api/export-wisdom-to-string.c, api/export-wisdom.c, api/forget-wisdom.c, api/import-wisdom-from-file.c, api/import-wisdom-from-string.c, api/import-wisdom.c, kernel/Makefile.am, kernel/ifftw.h, kernel/printers.c, kernel/scanners.c: added wisdom api * api/mapflags.c: grammar * api/mapflags.c: slight change * api/fftw3.h, api/mapflags.c: implemented api/mapflags * kernel/ifftw.h: IMPATIENT is an api issue 2003-01-10 Steven G. Johnson * api/the-planner.c: removed un-needed headers * api/the-planner.c: mkplanner initializes nthr to 1 already 2003-01-09 Steven G. Johnson * api/fftw3.h: boilerplate * rdft/vrank-geq1.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c: fold vecloop into r{e,o}dft apply function to share buffer, etcetera * tests/verify-reodft.c: whoops, bugfix in impulse test for vecn > 1 * rdft/hc2hc-buf.c: bugfix, grr * rdft/codelet-rdft.h: fixed signed-ness enum problem 2003-01-09 Matteo Frigo * kernel/md5-1.c: Explicit cast * api/Makefile.am, api/api.h, api/apiplan.c, api/configure.c, api/fftw3.h, api/the-planner.c: Added configure_planner(). mkplan() behaves properly when plan is null. * api/Makefile.am, api/api.h, api/apiplan.c, api/execute.c, api/fftw3.h, api/mapflags.c, api/plan-dft-1d.c, tests/bench.c: More API work * Makefile.am, api/Makefile.am, api/api.h, api/dfthelp.c, api/fftw3.h, api/plan-dft-1d.c, api/the-planner.c, configure.ac, kernel/ifftw.h, kernel/trig.c, tests/Makefile.am: First skeleton of API infrastructure 2003-01-09 Steven G. Johnson * rdft/rdft2-tensor-max-index.c: unsigned strikes again * rdft/Makefile.am, rdft/problem2.c, rdft/rdft2-inplace-strides.c, rdft/rdft2-tensor-max-index.c, rdft/vrank-geq1-rdft2.c: put rdft2_inplace_strides and rdft2_tensor_max_index in their own files for tighter linking * rdft/rank-geq2-rdft2.c, rdft/rdft.h, rdft/vrank-geq1-rdft2.c: added rdft2_tensor_max_index...incorrect use of tensor_max_index was preventing proper loop ordering for rnk > 2 rdft2 * rdft/rank-geq2-rdft2.c: arbitrary spltrnk in rdft2 rank-geq2 * tests/bench.c: don't mention wisdom when non-verbose * dft/problem.c, rdft/problem.c, rdft/problem2.c: bug fix: printing %T should pass tensor *, not tensor ** * reodft/rodft00e-r2hc.c, tests/verify-reodft.c: correct(?) normalization for rodft00 ... all of the even/odd transforms should be normalized according to the expanded'' DFT of ~twice the length * tests/verify-reodft.c: fixed tests for n=1 * tests/bench.c: fixed bug in vector tests for rdft(2) * rdft/problem2.c: fixed handling when first rnk-1 dimensions compress to nothing (ugh) * rdft/Makefile.am, rdft/conf.c, rdft/nop2.c, rdft/rank0-rdft2.c, rdft/rdft.h: fixed incorrect/missing rdft2 rank-0 handling * rdft/problem2.c: bug fix: for rnk > 1, must compress rnk-1 dims separately (ugh) 2003-01-08 Steven G. Johnson * configure.ac: added trailing newline * ChangeLog: updated * rdft/problem.c: got rid of compiler warning * tests/bench.c: whoops, test r2hc and not rodft00 by default * rdft/buffered.c, rdft/indirect.c, rdft/problem.c, rdft/rank-geq2.c, rdft/rdft.h, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, tests/bench.c, tests/verify-reodft.c: got rid of real_n...use physical n everywhere in rdft; fixed rdft sz compression; fixed rodft00 verify bug 2003-01-08 Matteo Frigo * simd/Makefile.am, simd/sse-aux.c, simd/sse.c, simd/sse2-aux.c, simd/sse2.c: icc-6.0 bug workaround * kernel/ifftw.h, rdft/buffered2.c, rdft/rader-hc2hc.c, tests/bench.c: Reclaimed the fftw_real identifier, because I need it for the API * configure.ac: Use recommended AC_OUTPUT syntax * kernel/ifftw.h, tests/bench.c: Removed FFTW(foo) as a synonym for X(foo). This is an API issue. 2003-01-07 Steven G. Johnson * simd/sse2.c: get rid of warning 2003-01-07 Matteo Frigo * dft/Makefile.am, dft/codelet-dft.h, dft/codelet.h, dft/codelets/inplace/Makefile.am, dft/codelets/n.c, dft/codelets/standard/Makefile.am, dft/codelets/t.c, dft/dft.h, dft/simd/codelets/Makefile.am, dft/simd/n1b.c, dft/simd/n1f.c, dft/simd/t1b.c, dft/simd/t1f.c, rdft/Makefile.am, rdft/codelet-rdft.h, rdft/codelet.h, rdft/codelets/hc2r.c, rdft/codelets/hc2r/Makefile.am, rdft/codelets/hfb.c, rdft/codelets/r2hc.c, rdft/codelets/r2hc/Makefile.am, rdft/rdft.h, support/Makefile.am, support/Makefile.codelets, support/codelet_prelude, support/codelet_prelude.dft, support/codelet_prelude.rdft: Renamed conflicting files */codelet.h into dft/codelet-dft.h and rdft/codelet-rdft.h 2003-01-07 Steven G. Johnson * ChangeLog: updated 2003-01-07 Matteo Frigo * simd/simd-3dnow.h, simd/simd-sse.h, simd/simd-sse2.h, simd/sse2.c: Silence warnings 2003-01-07 Steven G. Johnson * dft/rank-geq2.c, rdft/rank-geq2.c: fftw2 used spltrnk=1 2003-01-07 Matteo Frigo * dft/codelet.h, rdft/codelet.h, simd/simd-sse.h, simd/sse.c: Silence warning 2003-01-07 Steven G. Johnson * TODO: noted deficiency 2003-01-07 Matteo Frigo * rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: Strengthened conditions for a problem to be POSSIBLY_UNALIGNED * dft/vrank-geq1.c, kernel/align.c, kernel/ifftw.h: Strengthened conditions for a plan to be POSSIBLY_UNALIGNED 2003-01-05 Steven G. Johnson * TODO: added copyright todo * kernel/planner.c: modified comment * tests/verify-rdft.c: fixed comment * TODO, tests/verify-rdft.c: implemented rdft2 verify 2003-01-04 Steven G. Johnson * configure.ac: fix --enable-single 2002-10-23 Steven G. Johnson * threads/threads.c: slight fixes * threads/threads.c: typo 2002-10-01 Matteo Frigo * genfft/annotate.ml, genfft/annotate.mli, genfft/c.ml, genfft/genutil.ml, genfft/magic.ml: Experimental stuff 2002-09-28 Matteo Frigo * configure.ac, dft/simd/codelets/Makefile.am, genfft/gen_notw_c.ml, genfft/gen_twiddle_c.ml, genfft/genutil.ml: Experimental Franz mode 2002-09-26 Matteo Frigo * kernel/tensor.c: const-correct * dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/ifftw.h, kernel/tensor7.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c: Reuse dimcmp routine for other purposes 2002-09-25 Matteo Frigo * dft/direct.c, kernel/ifftw.h, kernel/tensor.c, rdft/direct.c, rdft/direct2.c: Use tornk1 correctly. * rdft/rdft2-radix2.c: Hmm... I thought I had fixed this before... * dft/buffered.c, dft/rank0.c, kernel/tensor.c, rdft/buffered.c, rdft/buffered2.c, rdft/rank0.c: Collect more common idioms * dft/direct.c, rdft/direct.c, rdft/direct2.c: Still collecting common idioms... * dft/direct.c, rdft/direct.c, rdft/direct2.c: More garbage collection. * dft/buffered.c: More compact code * dft/buffered.c, dft/generic.c, dft/rader.c, kernel/alloc.c, kernel/ifftw.h, kernel/planner.c, kernel/stride.c, kernel/tensor.c, rdft/buffered.c, rdft/buffered2.c, rdft/dht-rader.c, rdft/generic.c, rdft/problem.c, rdft/rader-hc2hc.c: Collect common pattern if (foo) free(foo) ==> free0(foo) * dft/buffered.c, kernel/Makefile.am, kernel/buffered.c, kernel/ifftw.h, rdft/buffered.c, rdft/buffered2.c: Collect some common code in */buffered*.c 2002-09-24 Steven G. Johnson * rdft/problem.c, rdft/rdft.h: use STRUCT_HACK #define to determing rdft kind[] allocation * kernel/ifftw.h, kernel/planner.c: report total pcost of measured/estimated plans...epcost is especially useful to estimate the effects of various impatience flags on planning time for large transforms 2002-09-23 Matteo Frigo * kernel/Makefile.am, kernel/trig.c, kernel/trig1.c: Prevent unwanted inlining * kernel/ifftw.h, kernel/trig.c: Space compaction * kernel/Makefile.am, kernel/hash.c, kernel/ifftw.h, kernel/md5-1.c, kernel/planner.c, kernel/scan.c: Still reducing size 2002-09-22 Matteo Frigo * dft/buffered.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/direct.c, dft/generic.c, dft/indirect.c, dft/nop.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/ifftw.h, kernel/ops.c, kernel/plan.c, rdft/buffered.c, rdft/buffered2.c, rdft/dht-rader.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/indirect.c, rdft/nop.c, rdft/nop2.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rank0.c, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: Saved another 5KB by redesigning opcnt protocol. (gasp!) * dft/buffered.c, dft/direct.c, dft/indirect.c, dft/problem.c, dft/rank-geq2.c, kernel/Makefile.am, kernel/ifftw.h, kernel/tensor1.c, kernel/tensor4.c, kernel/tensor8.c, rdft/buffered.c, rdft/dft-r2hc.c, rdft/direct.c, rdft/hc2hc.c, rdft/indirect.c, rdft/problem.c, rdft/problem2.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c: More code compression * kernel/ifftw.h, kernel/solver.c: Smaller code size. * dft/Makefile.am, dft/dft.h, dft/rader-omega.c, dft/rader.c, rdft/rader-hc2hc.c: Started unification of rader * rdft/rdft2-radix2.c: Typo * dft/buffered.c, dft/ct.c, dft/direct.c, dft/generic.c, dft/indirect.c, dft/nop.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/ifftw.h, kernel/plan.c, kernel/problem.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc.c, rdft/indirect.c, rdft/nop.c, rdft/nop2.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rank0.c, rdft/rdft-dht.c, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, threads/dft-vrank-geq1.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: Changed protocol for destroy_plan so as to save space. * dft/buffered.c, dft/ct.c, dft/generic.c, dft/indirect.c, dft/rader.c, dft/rank-geq2.c, dft/vrank-geq1.c, kernel/ifftw.h, kernel/planner.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/generic.c, rdft/hc2hc.c, rdft/indirect.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rdft-dht.c, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: Introduced convenient function X(mkplan_d) * kernel/Makefile.am, kernel/md5-1.c, kernel/md5.c, kernel/tensor.c, kernel/tensor1.c, kernel/tensor2.c, kernel/tensor3.c, kernel/tensor4.c, kernel/tensor5.c, kernel/tensor7.c: Split tensor/md5 into separate files to allow independent linking and/or prevent undesidred inlining * rdft/indirect.c, rdft/nop.c, rdft/nop2.c, rdft/problem.c, rdft/problem2.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rank0.c, rdft/rdft-dht.c, rdft/rdft.h, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, tests/debug.h, tests/dotens.c, tests/dotens2.c, tests/verify-dft.c, tests/verify-lib.c, tests/verify-rdft.c, tests/verify-reodft.c, tests/verify.h, dft/buffered.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/dft.h, dft/direct.c, dft/generic.c, dft/indirect.c, dft/nop.c, dft/problem.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, dft/zero.c, kernel/ifftw.h, kernel/tensor.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c: Treat all tensors as dynamically allocated objects. They were dynamically allocated in part anyway, so there is no point in complicating the object code with the clumsy calling conventions for by-value structs. 2002-09-21 Steven G. Johnson * kernel/ifftw.h: typo 2002-09-21 Matteo Frigo * tests/verify-lib.c: Avoid generating NaN when n = 0. * dft/dft.h, dft/problem.c, dft/rank-geq2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/hc2hc.c, rdft/problem.c, rdft/problem2.c, rdft/rank-geq2.c, rdft/rdft-dht.c, rdft/rdft.h, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, threads/dft-vrank-geq1.c: Saved more. * threads/dft-vrank-geq1.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c, dft/buffered.c, dft/ct.c, dft/direct.c, dft/indirect.c, dft/nop.c, dft/problem.c, dft/rank-geq2.c, dft/vrank-geq1.c, dft/vrank3-transpose.c, kernel/ifftw.h, kernel/pickdim.c, kernel/print.c, kernel/tensor.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/direct.c, rdft/hc2hc.c, rdft/indirect.c, rdft/nop.c, rdft/problem.c, rdft/problem2.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rdft-dht.c, rdft/rdft.h, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank3-transpose.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, tests/verify-dft.c, tests/verify-lib.c, tests/verify-rdft.c, tests/verify-reodft.c: Save 1200 bytes of object code. Do not pass structs by value whenever practical, because the calling protocol generates clumsy code. * rdft/dht-rader.c: Do not allocate buffers for rader omegas. Let the planner do it if necessary. * tests/verify-rdft.c, tests/verify-reodft.c: Check rank *before* reading kind[0], which may be undefined if rnk < 1 * dft/rader.c, rdft/rader-hc2hc.c: Second step towards rader unification. * dft/rader.c, kernel/Makefile.am, kernel/ifftw.h, kernel/rader.c, rdft/dht-rader.c, rdft/rader-hc2hc.c: First step towards unification of Rader code * dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, kernel/Makefile.am, kernel/ct.c, kernel/ifftw.h, kernel/planner.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/rdft-dht.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, threads/ct-dit.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c: Fix ugliness condition for cooley-tukey. 2002-09-20 Matteo Frigo * dft/rader.c, kernel/ifftw.h, rdft/dht-rader.c, rdft/rader-hc2hc.c: Removed RADER_MIN_GOOD and associated machinery * rdft/dht-r2hc.c: Proper cast * kernel/planner.c: Typo * dft/generic.c, kernel/ifftw.h, rdft/dht-rader.c, rdft/generic.c, rdft/rdft-dht.c, tests/bench.c: Implemented NO_LARGE_GENERIC 2002-09-19 Matteo Frigo * rdft/dht-r2hc.c, kernel/ifftw.h: Consistent macroization of NO_DHT_R2HC * kernel/ifftw.h, kernel/planner.c, rdft/dht-r2hc.c, tests/bench.c: NO_DHT_R2HC is a planner flag, otherwise the EXHAUSTIVE planner loops. * kernel/ifftw.h, kernel/planner.c: Resurrected NO_EXHAUSTIVE 2002-09-18 Steven G. Johnson * threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: au revoir, score() * tests/bench.c, tests/verify-reodft.c: eliminated unused * kernel/planner.c: capitalize and parenthesize SUBSUMES * kernel/ifftw.h: comment 2002-09-18 Matteo Frigo * kernel/ifftw.h, kernel/planner.c: Use flags from wisdom if wisdom is applicable. * rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, tests/bench.c, dft/buffered.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/direct.c, dft/generic.c, dft/indirect.c, dft/nop.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/ifftw.h, kernel/planner.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/indirect.c, rdft/nop.c, rdft/nop2.c, rdft/rader-hc2hc.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rank0.c, rdft/rdft-dht.c, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c: Removed score() machinery * kernel/planner.c: Revised planner hack * simd/simd-altivec.h: Fix warning 2002-09-17 Matteo Frigo * dft/indirect.c, rdft/indirect.c: Type qualifiers. * kernel/planner.c: ESTIMATE is no longer subsumed by everything else. * dft/indirect.c, rdft/indirect.c: NO_BUFFERING is a planner flag, not a problem flag * kernel/ifftw.h, kernel/planner.c: Maintain flags in canonical form. * kernel/planner.c, kernel/ifftw.h: In dramatic break with tradition, SUBSUME is now a partial order. I swear. * kernel/planner.c: Added comment * kernel/ifftw.h, kernel/planner.c, tests/bench.c: Inverted ESTIMATE flag, renamed USE_SCORE for consistency with the convention that 0 subsumes 1. 2002-09-17 Steven G. Johnson * dft/indirect.c, kernel/ifftw.h, rdft/indirect.c, tests/bench.c: NO_INDIRECT -> NO_INDIRECT_OP (out-of-place only) * acx_pthread.m4: hpux needs -D_REENTRANT (thanks to Clinton Roy for the bug report) 2002-09-17 Matteo Frigo * kernel/planner.c: Oops. * kernel/ifftw.h, kernel/planner.c: Yet another attempt at getting the planner right. * kernel/planner.c: Better coding. * kernel/ifftw.h, kernel/planner.c: NO_UGLY is no longer a flag, but a separate planner field that does not interfere with wisdom. 2002-09-16 Matteo Frigo * tests/verify-reodft.c: Did not compile without FFTW_DEBUG * kernel/ifftw.h, kernel/plan.c, kernel/planner.c, tests/bench.c: Changed scoring mechanism. * kernel/planner.c: Count infeasible plans * kernel/planner.c: curse subsumed plans before export 2002-09-16 Steven G. Johnson * kernel/ifftw.h, kernel/planner.c: removed ESTIMATE_BIT vs. ESTIMATE... ESTIMATE | IMPATIENT is a UI issue * rdft/buffered2.c: cleanup * dft/buffered.c, rdft/buffered.c, rdft/buffered2.c: use CONSERVE_MEMORY flag to prevent buffered for large sizes * kernel/ifftw.h: moved NO_DHT_R2HC back into planner flags: there's no reason we would want this flag to block plan reuse * kernel/ifftw.h: whoops, commas * kernel/ifftw.h: problem_flags == checked in applicable, planner_flags == checked in score * kernel/ifftw.h, kernel/planner.c: ESTIMATE should not *include* all impatience flags, even if it subsumes them; some impatience flags, like NO_INDIRECT, might make a problem unsolvable * kernel/planner.c: quotatio marks * kernel/planner.c: delete blank line * kernel/planner.c: substitution * kernel/planner.c: note that we are not GNUlly correct * kernel/planner.c: indenting * kernel/planner.c: more jokes * dft/ct-dit.c, dft/vrank-geq1.c, kernel/ifftw.h, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: NONTHREADED_ICKYP includes nthr > 1 check * kernel/md5.c: use md5sig * kernel/ifftw.h, kernel/planner.c: md5sig typedef * ChangeLog: updated * dft/buffered.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct.c, dft/indirect.c, dft/rank-geq2.c, dft/vrank-geq1.c, kernel/ifftw.h, kernel/planner.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c, rdft/indirect.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rdft-dht.c, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, tests/bench.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: partially-ordered impatience 2002-09-14 Matteo Frigo * kernel/Makefile.am, kernel/ifftw.h, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, tests/bench.c: Removed all that planner inheritance crap. 2002-09-14 Steven G. Johnson * kernel/planner.c: string.h is used for more than strlen 2002-09-14 Matteo Frigo * kernel/ifftw.h, kernel/planner.c: Reduced hashtable size by 1/6 (on 32-bit machines) at the expense of messier planner. * tests/bench.c: Only print wisdom if verbose > 3 * genfft-k7/variable.ml, genfft/variable.ml: Changed syntax of temporaries to avoid shadowing library functions (which is harmless but I hate the warning) 2002-09-14 Steven G. Johnson * acinclude.m4, configure.ac, dft/rader.c, kernel/alloc.c, kernel/assert.c, kernel/ifftw.h, kernel/md5.c, kernel/planner-score.c, kernel/primes.c, kernel/scan.c, libbench/bench-user.h, libbench/bench.h, libbench/report.c, libbench/timer.c, libbench/util.c, libbench/verify.c, rdft/rader-hc2hc.c, tests/bench.c, tests/verify-lib.c: only add warnings in debug/maintainer mode, and add a few more warning flags; eliminate more warnings; add support for posix_memalign (broken in glibc, grrr) 2002-09-14 Matteo Frigo * kernel/twiddle.c: Explicit cast * kernel/ifftw.h, kernel/planner.c, kernel/primes.c: Use double-hashing. This allows a slightly higher load factor at the expense of a messier computation of the hashtable size. 2002-09-13 Steven G. Johnson * genfft/magic.ml: typo 2002-09-13 Matteo Frigo * kernel/planner.c: Slight change in hash table growth functions. * kernel/ifftw.h, kernel/planner.c: More statistics. * kernel/planner.c: Clearer logic. * kernel/planner.c: Oops. * kernel/planner.c: Cleaned up * kernel/planner.c: Deal properly with infeasible problems. * kernel/planner.c: Redundantly initialize hash table to prevent valgrind warnings. 2002-09-12 Matteo Frigo * kernel/md5.c: Removed relics from past. * kernel/ifftw.h, kernel/planner.c: md5hash a problem only once. * genfft-k7/genUtil.ml, genfft-k7/gen_notw.ml, genfft-k7/gen_twiddle.ml: Renamed k7 codelets 2002-09-12 Steven G. Johnson * kernel/ifftw.h, rdft/dht-r2hc.c: FORBID_DHT_R2HC -> DHT_R2HC_VERBOTEN for consistency * kernel/ifftw.h: removed obsolete macro 2002-09-12 Matteo Frigo * dft/simd/n1b.c, dft/simd/n1f.c, dft/simd/t1b.c, dft/simd/t1f.c: Split flags in SIMD code. * threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/rdft-vrank-geq1.c, threads/vrank-geq1-rdft2.c: Forgot to fix threads * dft/buffered.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct.c, dft/indirect.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c, kernel/ifftw.h, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/dht-r2hc.c, rdft/dht-rader.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c, rdft/indirect.c, rdft/rank-geq2-rdft2.c, rdft/rank-geq2.c, rdft/rdft-dht.c, rdft/rdft2-radix2.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, tests/bench.c: Split flags into planner_flags and problem_flags 2002-09-12 Steven G. Johnson * kernel/planner.c: tetrameter 2002-09-12 Matteo Frigo * kernel/planner.c: Overwrite less impatient solutions properly. * kernel/planner.c: Oops. * kernel/planner.c: Keep less impatient solution in case of conflict. Paranoid cast to uint in certain places. * kernel/ifftw.h, kernel/planner.c, tests/bench.c: Complete reimplementation of planner hash table. * kernel/planner.c: planner->cnt was not properly decremented. 2002-09-11 Steven G. Johnson * NEWS: typo 2002-09-09 Matteo Frigo * kernel/planner.c: Simplified * kernel/planner.c: Always overwrite old wisdom with new, in case the old is corrupt/conclicting. 2002-09-09 Steven G. Johnson * kernel/plan.c: added quote/joke 2002-09-09 Matteo Frigo * kernel/ifftw.h, kernel/md5.c, kernel/planner.c, kernel/print.c, kernel/scan.c, tests/bench.c: Completed wisdom import * dft/problem.c, kernel/ifftw.h, kernel/md5.c, rdft/problem.c, rdft/problem2.c: Slight cleanup of md5 interface. 2002-09-04 Matteo Frigo * kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c: More consistent protocol between planner and inferior. * kernel/planner.c: I can't think of any situation where saving infeasible problems would be desirable. Removed relevant code. * kernel/ifftw.h, kernel/planner.c, kernel/solvtab.c, tests/bench.c: Encoder registrar's names in wisdom. Remove export_conf, since a separate program can now generate it. 2002-09-03 Matteo Frigo * kernel/planner.c: Fixed typo * kernel/planner.c: Fixed broken trochaic meter. * kernel/planner.c: Initialize planner->score. It is correct to leave it uninitialized, but I don't want people to send reports about purify complaining. * kernel/planner.c: More latin silliness 2002-09-02 Steven G. Johnson * ChangeLog: updated * kernel/timer.c: added clock() getseconds timer 2002-09-02 Matteo Frigo * rdft/indirect.c: Oops * dft/indirect.c, kernel/ifftw.h, rdft/indirect.c: Experimental INDIRECT_VERBOTEN flag (not used) * dft/buffered.c, dft/indirect.c, kernel/ifftw.h, rdft/buffered.c, rdft/buffered2.c, rdft/indirect.c: Do not allow buffering in children of indirect solvers. * kernel/planner.c: Oops * kernel/planner.c: Hash sizeof(R) as part of wisdom. 2002-09-02 Steven G. Johnson * configure.ac: added --enable-float synonym for --enable-single (since with have --enable-long-double) 2002-09-02 Matteo Frigo * dft/Makefile.am, dft/problem.c, dft/zero.c: zerotens is now in its own file, so it does not cause dft to be linked in if only rdft is used. * kernel/planner.c: Removed unused var. * kernel/planner.c: Split insert() in preparation for wisdom import * dft/Makefile.am, dft/dft.h, dft/verify.c, kernel/Makefile.am, kernel/dotens.c, kernel/dotens2.c, kernel/ifftw.h, kernel/verify-lib.c, kernel/verify.h, rdft/Makefile.am, rdft/rdft.h, rdft/verify.c, reodft/Makefile.am, reodft/reodft.h, reodft/verify.c, tests/Makefile.am, tests/bench.c, tests/debug.h, tests/dotens.c, tests/dotens2.c, tests/verify-dft.c, tests/verify-lib.c, tests/verify-rdft.c, tests/verify-reodft.c, tests/verify.h: Moved debugging infrastructure to test directory so that it is not linked into the shared library. * kernel/print.c, kernel/planner.c: Reactivated wisdom export * kernel/verify-lib.c: Dump errors to stderr, not stdout. * kernel/Makefile.am, kernel/ifftw.h, kernel/planner-score.c, kernel/traverse.c, tests/bench.c: Removed traverse.c. traverse.c is no longer need for plan blessing. I figured out a way to avoid using it in planner-score.c, so the file is now redundant. 2002-09-01 Matteo Frigo * dft/conf.c, dft/dft.h, dft/problem.c, kernel/align.c, kernel/ifftw.h, kernel/planner.c, kernel/problem.c, kernel/scan.c, kernel/scanners.c, kernel/tensor.c, rdft/conf.c, rdft/problem.c, rdft/problem2.c, rdft/rdft.h: Removed code made obsolete by new MD5 scheme: problem equality tests, scanners, and associated list of problem kinds. * rdft/problem.c, rdft/problem2.c, dft/problem.c, kernel/Makefile.am, kernel/ifftw.h, kernel/md5.c, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, kernel/tensor.c: Started md5 implementation 2002-08-31 Matteo Frigo * kernel/ifftw.h, kernel/planner.c: Keep track of hit rate * kernel/planner.c: Only dump when verbose > 4 * dft/indirect.c, kernel/ifftw.h, kernel/plan.c, kernel/planner.c, tests/bench.c: Debugging infrastructure * kernel/planner.c, kernel/print.c: Use debug infrastructure to dump planner. * kernel/alloc.c, kernel/ifftw.h, kernel/plan.c, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, kernel/scan.c, kernel/scanners.c, tests/bench.c: Do not store plans in planner, plus general planner cleanup. 2002-08-30 Steven G. Johnson * kernel/ifftw.h, rdft/dht-r2hc.c: renamed IN_DHT_R2HC to the more general FORBID_DHT_R2HC * kernel/planner.c: eliminated unused var 2002-08-30 Matteo Frigo * kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c: Score planner was not working correctly when using wisdom. Fixed. * kernel/alloc.c, kernel/ifftw.h, kernel/stride.c: Use hash table in debug malloc 2002-08-30 Steven G. Johnson * NEWS: listed some good stuff * TODO: timed planner * TODO: fma? * TODO: update * rdft/Makefile.am, rdft/conf.c, rdft/dht-rader.c, rdft/rader-dht.c, rdft/rdft.h: rader-dht -> dht-rader * kernel/ifftw.h, rdft/Makefile.am, rdft/buffered2.c, rdft/conf.c, rdft/dht-r2hc.c, rdft/r2hc-hc2r.c, rdft/rader-dht.c, rdft/rank-geq2.c, rdft/rdft-dht.c, rdft/rdft.h: add DHT solver, and break up rader-dht and r2hc-hc2r * tests/bench.c: another option * dft/indirect.c, kernel/ifftw.h, kernel/tensor.c, rdft/indirect.c: generalized indirect solvers for fftw2-like buffering and more 2002-08-29 Steven G. Johnson * dft/vrank-geq1.c, kernel/ifftw.h, kernel/tensor.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: tensor_max_index and tensor_min_stride are now both unsigned * kernel/Makefile.am, kernel/iabs.c, kernel/ifftw.h, kernel/tensor.c, rdft/buffered2.c, rdft/problem2.c: added iabs.c, and tensor_min_stride returns min absolute value * rdft/buffered2.c: bug fix in cldrest hc2c/c2hc copy loops 2002-08-29 Matteo Frigo * TODO: Added things to do. 2002-08-29 Steven G. Johnson * configure.ac: added automake prereq 2002-08-29 Matteo Frigo * rdft/rdft2-radix2.c: Use indexed addressing * libbench/verify.c, rdft/rdft2-radix2.c: Ooops * kernel/ifftw.h: Oops 2002-08-29 Steven G. Johnson * threads/threads.c: updates to win32 threads code (ick) * Makefile.am, acx_pthread.m4, configure.ac, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/ct.h, dft/dft.h, dft/kdft-dif.c, dft/kdft-dit.c, dft/vrank-geq1.c, kernel/alloc.c, kernel/ifftw.h, kernel/planner.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c, rdft/hc2hc.h, rdft/khc2hc-dif.c, rdft/khc2hc-dit.c, rdft/rdft.h, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, tests/Makefile.am, tests/bench.c, threads/Makefile.am, threads/conf.c, threads/ct-dit.c, threads/dft-vrank-geq1.c, threads/hc2hc-dif.c, threads/hc2hc-dit.c, threads/rdft-vrank-geq1.c, threads/threads.c, threads/threads.h, threads/vrank-geq1-rdft2.c: added threaded version 2002-08-28 Steven G. Johnson * kernel/Makefile.am: fix make dist * rdft/rank-geq2-rdft2.c: whoops, bugfix for inverse 2002-08-28 Matteo Frigo * Makefile.am, configure.ac, kernel/Makefile.am, kernel/dfftw3.h, kernel/fftw3.h, kernel/ifftw.h, kernel/lfftw3.h, kernel/sfftw3.h, tests/Makefile.am: Use C9x convention for naming (fftwf etc.). Removed installable header files since they will be part of the API. 2002-08-28 Steven G. Johnson * rdft/problem.c: allow _1 variants to accept rnk 0 (sz 1) problems 2002-08-27 Steven G. Johnson * ChangeLog: updated 2002-08-27 Matteo Frigo * dft/rank0.c: Loop unroll is useless * dft/ct-ditbuf.c: Use indexed addressing 2002-08-26 Matteo Frigo * dft/vrank2-transpose.c, dft/vrank3-transpose.c: Use indexed addressing in transpose routines. (Seems to be slightly better on athlon.) 2002-08-26 Steven G. Johnson * reodft/redft00e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c: added comment about stability 2002-08-26 Matteo Frigo * rdft/rdft2-radix2.c: Approximate opcount * dft/rank-geq2.c, rdft/rank-geq2.c, rdft/rdft2-radix2.c: Finished rdft2 via dft/rdft 2002-08-26 Steven G. Johnson * TODO: some updates * rdft/dft-r2hc.c, rdft/direct.c, rdft/generic.c, rdft/hc2hc.c, rdft/indirect.c, rdft/problem.c, rdft/r2hc-hc2r.c, rdft/rader-dht.c, rdft/rader-hc2hc.c, rdft/rank-geq2.c, rdft/rdft.h, rdft/rdft2-radix2.c, rdft/verify.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c, reodft/verify.c, tests/bench.c, rdft/Makefile.am, rdft/buffered.c, rdft/buffered2.c, rdft/conf.c: rdft kind is now per-dimension, added rdft/rank-geq2 * rdft/problem.c: added note * rdft/problem.c: must zero real sz * dft/rank-geq2.c, dft/vrank-geq1.c, kernel/Makefile.am, kernel/ifftw.h, kernel/pickdim.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, tests/bench.c: unified pickdim funcs 2002-08-25 FFTW Maintainers * libbench/mp.c, rdft/codelet.h, rdft/indirect.c, rdft/rank-geq2-rdft2.c, rdft/verify.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/verify.c: silence warnings 2002-08-25 Matteo Frigo * dft/codelet.h, dft/codelets/n.c, dft/codelets/t.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/direct.c, dft/simd/n1b.c, dft/simd/n1f.c, dft/simd/t1b.c, dft/simd/t1f.c, dft/vrank-geq1.c, kernel/ifftw.h, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c: I had to add another planner flag to record whether pointers could become unaligned because of vrank-geq1 solvers (these solvers only plan the first element of a vector problem, but the second element may have a different alignment). This addition is ugly, but I don't see any way around it. * TODO: Added thoughts * rdft/Makefile.am, rdft/conf.c, rdft/rdft.h, rdft/rdft2-dft.c, rdft/rdft2-radix2.c: Implemented rdft2 via vector rdft + radix2 step 2002-08-24 Matteo Frigo * rdft/rdft2-dft.c: Stylistic changes * dft/ct.c, dft/generic.c, kernel/ifftw.h, kernel/twiddle.c, rdft/generic.c, rdft/rdft2-dft.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c: Simplified mktwiddle interface * dft/ct-dif.c, dft/ct-dit.c, dft/ct.c, dft/ct.h, kernel/ifftw.h, kernel/tensor.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c, rdft/hc2hc.h, rdft/rdft2-dft.c: Unification of certain vector computations. rdft2-dft is now a vector transform. * configure.ac, simd/sse.c, simd/sse2.c: Intel compiler seems to be still buggy 2002-08-23 Matteo Frigo * dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/ct.h, dft/generic.c, dft/indirect.c, kernel/ifftw.h, kernel/twiddle.c, rdft/generic.c, rdft/hc2hc.c, rdft/rdft2-dft.c, reodft/redft00e-r2hc.c, reodft/reodft010e-r2hc.c, reodft/reodft11e-r2hc.c, reodft/rodft00e-r2hc.c: Streamlined twiddle protocol * libbench/verify.c, rdft/Makefile.am, rdft/conf.c, rdft/rdft.h, rdft/rdft2-dft.c: Implemented rdft2 via dft (forward only for now) 2002-08-22 Matteo Frigo * kernel/verify-lib.c, libbench/verify.c: More cleanup of verify * kernel/verify-lib.c: Changed error criterion because old one was too strict * bootstrap.sh: Disable shared * TODO: Added thoughts * dft/generic.c: Oops * dft/generic.c, kernel/alloc.c, kernel/planner-score.c, kernel/tensor.c: Do not use inline. Minor changes. 2002-08-21 Steven G. Johnson * tests/bench.c: more commented flags 2002-08-20 Steven G. Johnson * reodft/Makefile.am, reodft/conf.c, reodft/reodft11e-r2hc.c, reodft/verify.c, tests/bench.c: added DCT-IV and DST-IV 2002-08-20 Matteo Frigo * genfft/twiddle.ml: Slight improvement in twiddle scheme 2002-08-20 Steven G. Johnson * reodft/conf.c, reodft/reodft.h, reodft/reodft010e-r2hc.c: name fix * reodft/reodft010e-r2hc.c: removed extraneous variable 2002-08-20 Matteo Frigo * libbench/mp.c, libbench/verify.c: Oops * genfft/twiddle.ml, kernel/trig.c: Still playing around 2002-08-19 Matteo Frigo * TODO, genfft/algsimp.ml, genfft/expr.ml, genfft/expr.mli, genfft/twiddle.ml, support/addchain.c: Playing around with addition chain 2002-08-19 Steven G. Johnson * reodft/redft00e-r2hc.c, reodft/rodft00e-r2hc.c: comments * reodft/reodft010e-r2hc.c: comment fixes * Makefile.am, configure.ac, dft/dft.h, rdft/rdft.h, reodft/Makefile.am, reodft/conf.c, reodft/redft00e-r2hc.c, reodft/reodft.h, reodft/reodft010e-r2hc.c, reodft/rodft00e-r2hc.c, reodft/verify.c, tests/Makefile.am, tests/bench.c: added reodft stuff 2002-08-18 Matteo Frigo * libbench/Makefile.am, libbench/verify.c: Sync with nbenchfft * genfft/complex.ml, genfft/complex.mli, genfft/twiddle.ml: Economy of thought 2002-08-17 Steven G. Johnson * support/Makefile.am: distribute addchain.c 2002-08-17 Matteo Frigo * support/addchain.c: Nothing serious * genfft/twiddle.ml, support/addchain.c: New twiddle policy (disabled for now) 2002-08-17 Steven G. Johnson * rdft/rank-geq2-rdft2.c: bug fix for hc2r (must use inverse dft) 2002-08-17 Matteo Frigo * dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, genfft/twiddle.ml, rdft/codelets/hc2r/Makefile.am, rdft/codelets/r2hc/Makefile.am: New log3 twiddle policy 2002-08-16 Matteo Frigo * dft/verify.c, kernel/verify-lib.c, kernel/verify.h, rdft/verify.c: More verify cleanup * rdft/verify.c: Oops * dft/verify.c, kernel/Makefile.am, kernel/verify-lib.c, kernel/verify.h, rdft/verify.c: Economy of thought (and code) * TODO: Added comment * libbench/mp.c: Cleaner rounding algorithm * libbench/mp.c: Can get away with shorter length in bluestein (I think). * libbench/mp.c: Portability improvements * libbench/bench-main.c, libbench/bench.h, libbench/verify.c: Optionally average accuracy test over many rounds * dft/rader.c, rdft/rader-dht.c, rdft/rader-hc2hc.c: More accurate formula for trig tables * libbench/mp.c, libbench/verify.c: Implemented accuracy test for all integers 2002-08-15 Matteo Frigo * libbench/mp.c: inv, neg: make static * libbench/verify.c: Verify was not complete for real transforms * libbench/verify.c: Oops * genfft/gen_hc2hc.ml, libbench/verify.c: Fixed hb codelets * dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, rdft/codelets/r2hc/Makefile.am: Changed twiddle policy 2002-08-15 Steven G. Johnson * rdft/direct2.c: whoops 2002-08-15 Matteo Frigo * libbench/Makefile.am, tests/Makefile.am: No point in libbench being a shared library * libbench/Makefile.am, libbench/bench-main.c, libbench/bench.h, libbench/mp.c, libbench/util.c, libbench/verify.c, tests/Makefile.am, tests/accuracy.c, tests/mp.c: Moved accuracy test to libbench 2002-08-14 Matteo Frigo * tests/accuracy.c: Modified accuracy test * tests/accuracy.c, tests/mp.c: Fixes for long double * tests/accuracy.c: Normalize input * tests/accuracy.c: Oops * tests/accuracy.c: Also compute relative error * tests/accuracy.c: Loop over N * tests/Makefile.am, tests/accuracy.c, tests/mp.c: simple-minded accuracy test 2002-08-14 Steven G. Johnson * rdft/rank-geq2-rdft2.c: whoops 2002-08-13 Matteo Frigo * kernel/trig.c: fma() stuff is too nonportable, removed 2002-08-12 Steven G. Johnson * rdft/problem.c: slight fix * rdft/problem.c: use table for rdft_kind_str * rdft/problem2.c: slight fixes * kernel/ifftw.h, kernel/planner.c, kernel/tensor.c, rdft/Makefile.am, rdft/buffered2.c, rdft/conf.c, rdft/direct2.c, rdft/nop2.c, rdft/problem2.c, rdft/rdft.h, rdft/vrank-geq1-rdft2.c, tests/bench.c: multidimensional rdft2 2002-08-10 Steven G. Johnson * rdft/indirect.c: use tensor_copy_inplace * dft/rank-geq2.c: bugfix, use tensor_copy_inplace * dft/indirect.c: use tensor_copy_inplace * kernel/ifftw.h, kernel/tensor.c: added tensor_copy_inplace * kernel/twiddle.c: fixed trig-function table type 2002-08-10 Matteo Frigo * kernel/trig.c, tests/trigtest.c: Improved trig scheme * tests/trigtest.c: Allow for testing using long double instead of pari * kernel/trig.c, tests/trigtest.c: Yet another trig scheme. * kernel/trig.c, tests/trigtest.c: Yet another scheme * kernel/ifftw.h, kernel/trig.c, tests/trigtest.c: Careful with overflow * kernel/ifftw.h, kernel/trig.c, tests/trigtest.c: Avoid overflow 2002-08-09 Matteo Frigo * dft/rader.c, dft/verify.c, kernel/ifftw.h, kernel/trig.c, kernel/twiddle.c, rdft/rader-dht.c, rdft/rader-hc2hc.c, rdft/verify.c, tests/trigtest.c: New(er) trig routines * tests/bench.c: Oops * tests/trigtest.c: New file * TODO: Commented about likely gcc bug * dft/rader.c, dft/verify.c, kernel/Makefile.am, kernel/ifftw.h, kernel/trig.c, kernel/twiddle.c, rdft/rader-dht.c, rdft/rader-hc2hc.c, rdft/verify.c, tests/bench.c: Improved accuracy of twiddle factors 2002-08-08 Matteo Frigo * simd/simd-3dnow.h: Wrong comment 2002-08-07 Matteo Frigo * configure.ac, genfft/gen_notw_c.ml, genfft/gen_twiddle_c.ml, kernel/ifftw.h, simd/3dnow.c, simd/Makefile.am, simd/simd-3dnow.h, simd/simd-altivec.h, simd/simd-sse.h, simd/simd-sse2.h, simd/simd.h: Experimental 3dnow port using gcc, to compare it with Stefan's stuff. * genfft/c.ml, kernel/ifftw.h: End of AREF experiment * configure.ac: Oops * configure.ac: Pathetic attempt to reduce size of configure script * genfft/c.ml, kernel/ifftw.h: Changed array syntax for experiments. 2002-08-06 Matteo Frigo * simd/simd-sse2.h: Fix warning * dft/problem.c, kernel/align.c, kernel/ifftw.h, rdft/problem.c, rdft/problem2.c: Move nonportable stuff in one place. * kernel/planner.c: Economy of thought: I didn't like having two algorithms for removing solutions, both correct. At least now we have the same algorithm copied twice. * TODO: Added things to do 2002-08-05 Steven G. Johnson * kernel/ifftw.h, kernel/planner.c: improved interaction of planner with patience flags * rdft/buffered.c, rdft/codelet.h, rdft/indirect.c, rdft/problem.c, rdft/rader-hc2hc.c, rdft/rdft.h, tests/bench.c: set up for real-even/odd DFTs, where n is not the size of the data * dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/ct.h, kernel/ifftw.h, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/hc2hc.c, rdft/hc2hc.h, rdft/r2hc-hc2r.c, tests/bench.c: DESTROY_INPUT flag * dft/rank-geq2.c, dft/vrank-geq1.c, kernel/ifftw.h, kernel/planner.c, rdft/dft-r2hc.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, tests/bench.c: CLASSIC -> IMPATIENT 2002-08-04 Matteo Frigo * genfft-k7/Makefile.am, genfft/Makefile.am: Require make maintainer-clean to remove the generator, as opposed to make clean. In this way we can type make clean without regenerating all codelets. 2002-08-04 Steven G. Johnson * kernel/planner.c: ESTIMATE plans are not blessed * kernel/ifftw.h, kernel/planner.c: use flags in wisdom * dft/buffered.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/direct.c, dft/generic.c, dft/indirect.c, dft/nop.c, dft/rader.c, dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/ifftw.h, kernel/planner-score.c, kernel/tensor.c, rdft/buffered.c, rdft/buffered2.c, rdft/dft-r2hc.c, rdft/direct.c, rdft/direct2.c, rdft/generic.c, rdft/hc2hc-buf.c, rdft/hc2hc-dif.c, rdft/hc2hc-dit.c, rdft/indirect.c, rdft/nop.c, rdft/nop2.c, rdft/r2hc-hc2r.c, rdft/rader-dht.c, rdft/rader-hc2hc.c, rdft/rank0.c, rdft/vrank-geq1-rdft2.c, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c, tests/bench.c: score now takes plnr, not flags, as arg * acinclude.m4: align initial stack in alignment check, which should now pass for gcc 3.1.1 2002-08-04 Matteo Frigo * acinclude.m4: Detect ultrasparc (sort of) 2002-08-03 Steven G. Johnson * rdft/codelet.h: added solvtab_rdft_r2r placeholder 2002-08-03 Matteo Frigo * support/Makefile.codelets: Damn solaris 2002-08-03 Steven G. Johnson * rdft/problem.c: use E extended precision in solvers * rdft/codelet.h: an alternative notation for D{C,S}T: DXTio, where i/o are {0,1} according to whether the input/output are shifted, respectively. Alternatively, io is the binary representation of the usual DXT-{I,II,III,IV} nomenclature, minus 1. * dft/generic.c, dft/rader.c, rdft/generic.c, rdft/r2hc-hc2r.c, rdft/rader-dht.c: use E extended precision in solvers 2002-08-03 Matteo Frigo * configure.ac, kernel/cycle.h, kernel/planner.c, rdft/problem2.c: More portability fixes, compiler bugs workarounds, etc. * configure.ac, kernel/cycle.h, kernel/ifftw.h: More portability work * acinclude.m4, configure.ac, kernel/cycle.h, kernel/ifftw.h, support/Makefile.codelets: Improved portability, removed gnu make dependencies * TODO: Remember to thank XXX 2002-08-02 Matteo Frigo * simd/simd-altivec.h: Multiplication on altivec requires FMA with -0.0 to be IEEE754 compliant. * genfft/c.ml, kernel/ifftw.h: Allow for extended precision in codelets * dft/codelets/inplace/Makefile.am: Shortened names 2002-08-02 Steven G. Johnson * TODO, rdft/codelet.h, rdft/problem.c: added infrastructure for future r2r transforms 2002-08-02 Matteo Frigo * Makefile.am, configure.ac: Version info * dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, dft/k7/codelets/Makefile.am, dft/simd/codelets/Makefile.am, kernel/align.c, rdft/codelets/hc2r/Makefile.am, rdft/codelets/r2hc/Makefile.am: Listened to one customer and added radix-12. Added radix-15 for consistency (whatever that is) 2002-08-01 Steven G. Johnson * kernel/cycle.h: whoops again, fixed the wrong line * kernel/cycle.h: whoops * configure.ac, kernel/planner.c: use new AC_INIT and add VERSION to wisdom * kernel/scan.c: mygetR -> getR * dft/problem.c, kernel/planner.c, kernel/scan.c, kernel/tensor.c, rdft/problem.c, rdft/problem2.c, tests/bench.c: scanner cleanups: just return 0/1, simplify integer reads 2002-08-01 Matteo Frigo * kernel/align.c: Reverted back to casting pointer to ulong * kernel/ifftw.h: Cast to unsigned long, not long 2002-08-01 Steven G. Johnson * kernel/scan.c: additional comment * kernel/scan.c: added comment * dft/conf.c, dft/dft.h, dft/problem.c, dft/verify.c, kernel/Makefile.am, kernel/alloc.c, kernel/assert.c, kernel/debug.c, kernel/ifftw.h, kernel/planner.c, kernel/print.c, kernel/printers.c, kernel/problem.c, kernel/scan.c, kernel/scanners.c, kernel/tensor.c, kernel/timer.c, rdft/conf.c, rdft/problem.c, rdft/problem2.c, rdft/rdft.h, rdft/verify.c, tests/bench.c: added wisdom import * kernel/align.c: whoops * dft/problem.c, rdft/problem.c, rdft/problem2.c: use %u for alignment_of * kernel/align.c: ptrdiff_t form 2002-08-01 Matteo Frigo * kernel/ifftw.h: Cast to avoid warning from C++ compiler 2002-07-31 Matteo Frigo * dft/problem.c, kernel/Makefile.am, kernel/align.c, kernel/ifftw.h, rdft/problem.c, rdft/problem2.c, simd/simd.h: Make problem equality depend on alignments. * dft/simd/codelets/Makefile.am: Shorter names * simd/simd-sse.h: Oops * simd/simd-sse.h: Fix warning * kernel/alloc.c, kernel/ifftw.h, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c: Removed silly abstraction barrier. Also, cons() terminology was no longer appropriate. 2002-07-31 Steven G. Johnson * kernel/ifftw.h, kernel/planner.c, kernel/solvtab.c: removed register_registrar and solvtab_exec_reverse hacks 2002-07-30 Steven G. Johnson * kernel/planner.c: register_registrar doesn't search whole solver list (maybe we should change register_solver instead) * kernel/cycle.h: credit * kernel/cycle.h: added HP/UX ia64 support, courtesy of Teresa L. Johnson 2002-07-30 Matteo Frigo * dft/simd/n1b.c, dft/simd/n1f.c, dft/simd/t1b.c, dft/simd/t1f.c, kernel/alloc.c: Fixed alignment checks 2002-07-30 Steven G. Johnson * kernel/ifftw.h, kernel/planner.c, kernel/solvtab.c: ugh, wisdom id fixes in exprt_conf * kernel/ifftw.h, kernel/planner.c, tests/bench.c: exprt_registrars -> exprt_conf, added missing SOLVTAB_END * kernel/planner.c: exprt_registrars should output self-contained configuration * dft/conf.c, kernel/ifftw.h, kernel/planner.c, kernel/solvtab.c, rdft/conf.c, support/Makefile.codelets, tests/bench.c: added exprt_registrars * kernel/print.c: whoops 2002-07-30 Matteo Frigo * dft/simd/n1b.c, dft/simd/n1b.h, dft/simd/n1f.c, dft/simd/n1f.h, dft/simd/t1b.c, dft/simd/t1b.h, dft/simd/t1f.c, dft/simd/t1f.h, simd/simd-altivec.h, simd/simd-sse.h, simd/simd-sse2.h, simd/simd.h: More stringent requirements on strides for SIMD codelets 2002-07-30 Steven G. Johnson * rdft/buffered2.c: remove warning * rdft/problem.c, rdft/problem2.c, dft/problem.c, kernel/print.c, kernel/traverse.c: use %td for ptrdiff_t and %T for tensors 2002-07-29 Matteo Frigo * dft/buffered.c: Fix for SIMD * kernel/ifftw.h: Missing lfftw_mkstride and lfftw_stride_destroy * simd/simd-altivec.h: Implement LDA/STA * dft/simd/n1b.c, dft/simd/n1f.c, dft/simd/t1b.c, dft/simd/t1f.c, simd/simd-altivec.h, simd/simd-sse.h, simd/simd-sse2.h, simd/simd.h: More SIMD work * simd/simd-altivec.h, simd/simd-sse.h, simd/simd-sse2.h: Cleanup 2002-07-29 Steven G. Johnson * ChangeLog: update 2002-07-29 Matteo Frigo * dft/simd/n1b.c, dft/simd/n1f.c, dft/simd/t1b.c, dft/simd/t1f.c, simd/simd-altivec.h, simd/simd-sse.h, simd/simd-sse2.h: Also check strides in SIMD codelets * simd/simd-altivec.h: Minor changes, mostly for consistency with the big-endian processor 2002-07-29 Steven G. Johnson * rdft/rader-dht.c: added comment * configure.ac, kernel/alloc.c: added code for icc's _mm_malloc (memalign replacement) 2002-07-28 Steven G. Johnson * rdft/problem2.c, rdft/verify.c, tests/bench.c: slight fixes * rdft/problem2.c: whoops 2002-07-28 Matteo Frigo * simd/altivec.c, simd/simd-altivec.h: Use vec_xor to change sign 2002-07-28 Steven G. Johnson * rdft/Makefile.am, rdft/buffered2.c, rdft/conf.c, rdft/direct2.c, rdft/khc2r.c, rdft/kr2hc.c, rdft/nop2.c, rdft/plan2.c, rdft/problem2.c, rdft/rdft.h, rdft/solve2.c, rdft/verify.c, rdft/vrank-geq1-rdft2.c, tests/bench.c: added rdft2 2002-07-28 Matteo Frigo * simd/simd-altivec.h: Optimized * simd/simd-altivec.h: Changed ALIGNMENT * simd/simd-sse.h: alignment := 8 * simd/simd-altivec.h: Avoid warning * simd/simd-sse2.h: Oops * genfft/annotate.ml, genfft/genutil.ml, genfft/simd.ml, genfft/variable.ml, genfft/variable.mli, simd/simd-altivec.h, simd/simd-sse.h, simd/simd-sse2.h: New altivec experiment * simd/simd-altivec.h: Nothing * simd/simd-altivec.h: Oops * simd/simd-altivec.h: Nothing * simd/Makefile.am, simd/altivec.c, simd/simd-altivec.h: Constants are now in separate file. * simd/simd-altivec.h: More precise comment * simd/simd-altivec.h: gcc-3.1 bug workaround 2002-07-28 Steven G. Johnson * dft/buffered.c, dft/dft.h, dft/problem.c, rdft/buffered.c, rdft/problem.c, rdft/rdft.h: slight optimization, and exported zerotens functions * rdft/dft-r2hc.c: should be a plan_dft, not a plan_rdft 2002-07-28 Matteo Frigo * simd/simd-altivec.h: Optimizations. Make it work with vanilla non-Apple gcc. 2002-07-27 Steven G. Johnson * rdft/generic.c: whoops * rdft/generic.c: added hc2r (dif) * rdft/rader-hc2hc.c: add hc2r (dif) case 2002-07-27 Matteo Frigo * simd/simd-altivec.h, support/Makefile.codelets: Altivec port * kernel/twiddle.c: Fixed signed/unsigned bug. 2002-07-26 Matteo Frigo * dft/rank0.c, rdft/rank0.c: Make rank0 unapplicable to in-place problems. 2002-07-25 Steven G. Johnson * rdft/generic.c: only works for r odd 2002-07-25 Matteo Frigo * kernel/planner-score.c: Reinserted much better timing-avoidance heuristic * dft/buffered.c, kernel/ifftw.h, kernel/plan.c, kernel/planner-score.c, kernel/traverse.c, rdft/buffered.c, tests/bench.c: Score is now a property of the plan, not of the solver. Revised representation of closures. * genfft/gen_hc2r.ml, genfft/gen_r2hc.ml, rdft/codelets/hc2r/Makefile.am: Cosmetic changes. Added hc2r_128.c 2002-07-25 Steven G. Johnson * rdft/rader-dht.c: added hc2r * rdft/Makefile.am, rdft/hc2hc-buf.c, rdft/hc2hc-ditbuf.c, rdft/khc2hc-dif.c: added hc2hc-difbuf * rdft/Makefile.am, rdft/hc2hc-dif.c, rdft/hc2hc.c, rdft/khc2hc-dif.c, rdft/rdft.h: added rdft-dif * rdft/verify.c: whoops, hc2r must be conjugated to have right sign * dft/ct-dif.c: slight change * rdft/verify.c: whoops * rdft/Makefile.am, rdft/codelet.h, rdft/direct-r2hc.c, rdft/direct.c, rdft/khc2r.c, rdft/rdft.h: support hc2r codelets * rdft/dft-r2hc.c: use vector plan for r/i instead of two separate plans * dft/buffered.c, dft/rader.c, kernel/ifftw.h, rdft/buffered.c, rdft/rader-dht.c, rdft/rader-hc2hc.c: hack to allow rader/generic to work in-place for small prime sizes, instead of always using buffered 2002-07-24 Steven G. Johnson * rdft/Makefile.am, rdft/conf.c, rdft/generic.c: added rdft-generic * dft/generic.c: fixed add count * rdft/rader-hc2hc.c: again * rdft/rader-hc2hc.c: slight fix * rdft/rader-hc2hc.c: fixed comment * tests/bench.c: whoops * rdft/Makefile.am, rdft/conf.c, rdft/rader-hc2hc.c, rdft/rdft.h, tests/bench.c: added rader-hc2hc * dft/rader.c: whoops, initialize W * rdft/rader-dht.c: strides should not be unsigned * dft/rader.c: more stride sign fixes * dft/rader.c: strides should not be unsigned! 2002-07-23 Steven G. Johnson * rdft/dft-r2hc.c: added comment * rdft/r2hc-hc2r.c: another fix to op count * rdft/r2hc-hc2r.c: whoops * rdft/dft-r2hc.c, rdft/r2hc-hc2r.c: slight fix to op counts * rdft/conf.c, rdft/dft-r2hc.c, rdft/rdft.h, rdft/Makefile.am: added dft-r2hc * rdft/rader-dht.c: better comment and var. name * rdft/Makefile.am, rdft/conf.c, rdft/r2hc-hc2r.c, rdft/rdft.h, rdft/verify.c, tests/bench.c: fixed tests for hc2r, and added r2hc-hc2r * rdft/Makefile.am, rdft/conf.c, rdft/rader-dht.c, rdft/rdft.h: added rader-dht 2002-07-23 Matteo Frigo * rdft/codelets/r2hc/Makefile.am: Added r2hc_128, what the hell. * rdft/codelets/r2hc/Makefile.am: Added codelets that compute twiddle factors 2002-07-22 Steven G. Johnson * rdft/Makefile.am, rdft/buffered.c, rdft/conf.c: added rdft-buffered * rdft/Makefile.am, rdft/hc2hc-ditbuf.c, rdft/khc2hc-dit.c: added hc2hc-ditbuf * dft/generic.c: use STACK_MALLOC (alloca), since generic radix is always small * rdft/hc2hc-dit.c: small cleanup 2002-07-22 Matteo Frigo * rdft/problem.c: What the hell was I thinking? * rdft/problem.c: Reduced code size by using table instead of switch statement. * rdft/problem.c: Changed hash function to avoid collisions with DFT. 2002-07-22 Steven G. Johnson * rdft/hc2hc-dit.c: added missing file, whoops * rdft/hc2hc.c: whoops, generate enough twiddles for odd m * rdft/verify.c: don't try to verify R2HCII or HC2RIII plans * rdft/hc2hc.c: recursive case now works, I think * rdft/verify.c: add extra impulse test for debugging * rdft/direct-r2hc.c: whoops, multiply ios offset by stride (and rename to ioffset) * rdft/verify.c: whoops * genfft/gen_hc2hc.ml, rdft/Makefile.am, rdft/hc2hc.c, rdft/khc2hc-dit.c: added hc2hc-dit * kernel/twiddle.c: twiddles can be shared with smaller m's * rdft/Makefile.am, rdft/codelet.h, rdft/codelets/hfb.c, rdft/hc2hc.c, rdft/hc2hc.h: preparing for recursive rdft... 2002-07-21 Steven G. Johnson * rdft/verify.c: slight fix, to match libbench/verify.c * rdft/direct-r2hc.c: r2hcII has imag parts offset by n-1, not n. We can also allocate fewer strides. * rdft/rank0.c: delete unused var * rdft/Makefile.am, rdft/codelet.h, rdft/codelets/hc2r.c, rdft/codelets/r2hc.c, rdft/conf.c, rdft/direct-r2hc.c, rdft/indirect.c, rdft/khc2rIII.c, rdft/kr2hc.c, rdft/kr2hcII.c, rdft/nop.c, rdft/problem.c, rdft/rank0.c, rdft/rdft.h, rdft/vrank-geq1.c, rdft/vrank2-transpose.c, rdft/vrank3-transpose.c: added some rdft solvers * kernel/fftw3.h: pass identifier in FFTW() through another macro so that the mangled name can itself be a preprocessor symbol * dft/vrank-geq1.c: fix in comment * Makefile.am, rdft/rdft.h, tests/bench.c: bench tests rdft plans * rdft/codelet.h, tests/Makefile.am, tests/bench.c: make rdft.h and dft.h compatible * rdft/Makefile.am, rdft/problem.c, rdft/rdft.h, rdft/verify.c: first-draft rdft verify * rdft/khc2hc-dif.c, rdft/khc2hc-dit.c, rdft/khc2r.c, rdft/khc2rIII.c, rdft/kr2hc.c, rdft/kr2hcII.c: got rid of annoying warnings * rdft/Makefile.am, rdft/khc2hc-dif.c, rdft/khc2hc-dit.c, rdft/khc2r.c, rdft/khc2rIII.c, rdft/kr2hc.c, rdft/kr2hcII.c, rdft/rdft.h: added stub codelet registration for linking purposes * rdft/Makefile.am, rdft/conf.c, rdft/plan.c, rdft/problem.c, rdft/rdft.h, rdft/solve.c: basic rdft stuff * Makefile.am, configure.ac, dft/codelet.h, genfft/gen_hc2hc.ml, kernel/ifftw.h, rdft/Makefile.am, rdft/codelet.h, rdft/codelets/Makefile.am, rdft/codelets/hb.h, rdft/codelets/hc2r.c, rdft/codelets/hc2r.h, rdft/codelets/hc2r/Makefile.am, rdft/codelets/hc2rIII.h, rdft/codelets/hf.h, rdft/codelets/hfb.c, rdft/codelets/r2hc.c, rdft/codelets/r2hc.h, rdft/codelets/r2hc/Makefile.am, rdft/codelets/r2hcII.h: rdft codelets now compile 2002-07-20 Matteo Frigo * genfft/gen_hc2r.ml: Oops, was generating rdfts instead of hdfts * TODO, configure.ac, genfft-k7/twiddle.ml, genfft/twiddle.ml, kernel/twiddle.c, rdft/codelets/hc2r/Makefile.am: Added hc2r codelets * genfft/gen_hc2hc.ml: return W in hc2hc codelets * configure.ac, dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, dft/simd/codelets/Makefile.am, genfft/gen_hc2hc.ml, genfft/gen_hc2r.ml, genfft/gen_r2hc.ml, genfft/trig.ml, rdft/codelets/r2hc/Makefile.am, support/Makefile.codelets: Some work on rdft codelets 2002-07-16 Matteo Frigo * kernel/fftw3.h: fix const * acinclude.m4, configure.ac, dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, dft/indirect.c, dft/rank0.c, dft/simd/codelets/Makefile.am, kernel/version.c, tests/bench.c: Separate CFLAGS in codelets. Fix const in certain places. 2002-07-16 Steven G. Johnson * TODO: note buffering problem 2002-07-16 Matteo Frigo * dft/generic.c: Removed unpredictable branch from inner loop 2002-07-15 Steven G. Johnson * TODO: update * dft/generic.c: optimization * dft/Makefile.am, dft/conf.c, dft/dft.h, dft/generic.c, kernel/ifftw.h, kernel/twiddle.c: added generic dit * dft/rader.c: whoops, mksolver should be static 2002-07-15 Matteo Frigo * genfft/c.mli, genfft/gen_hc2hc.ml, genfft/gen_hc2r.ml, genfft/gen_r2hc.ml, genfft/genutil.ml, genfft/Makefile.am, genfft/algsimp.ml, genfft/c.ml: First implementation of gen_hc2hc, probably still buggy. 2002-07-15 Steven G. Johnson * dft/rader.c: don't count loading of twiddle factors in ops.other, since it isn't counted for the codelets * dft/ct.c, dft/rader.c, kernel/plan.c: plan_destroy puts plan to sleep before deallocating it, to eliminate duplicate free calls in solvers * dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct.c, dft/vrank-geq1.c, kernel/ifftw.h, tests/bench.c: fftw2-like vector recursion flag 2002-07-15 Matteo Frigo * kernel/planner.c: More jokes * tests/bench.c: Bless plan for testing purposes * kernel/planner.c: Canonical linked-list deletion (hope it is right) 2002-07-14 Steven G. Johnson * dft/rader.c: use estimating planner for cld_omega * dft/rader.c: better internal naming * dft/rader.c: printing should really be fixed now, grrr * dft/rader.c: print all distinct child plans * tests/bench.c: whoops * dft/rader.c: whoops, destroy should delete twiddle/omega from list * kernel/planner.c: whoops * kernel/ifftw.h, kernel/plan.c, kernel/planner.c: added plan_bless and FORGET_ACCURSED * kernel/traverse.c: further cleanup * kernel/traverse.c: slight cleanup * kernel/Makefile.am, kernel/ifftw.h, kernel/traverse.c, tests/bench.c: added traverse_plan via print (ugh) * dft/ct.c, kernel/ifftw.h, kernel/twiddle.c: added TW_FULL, and additional n parameter for twiddles * kernel/planner.c: whoops * kernel/ifftw.h, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c: save flags before invoking solver mkplan 2002-07-14 Matteo Frigo * TODO: *** empty log message *** 2002-07-14 Steven G. Johnson * configure.ac, kernel/cycle.h: added support for UNICOS _rtc() real-time-clock intrinsic function * kernel/timer.c: fixed typo: HAVE_TIME_H should include time.h, not sys/time.h * configure.ac, kernel/cycle.h: support AIX read_real_time timer 2002-07-13 Steven G. Johnson * configure.ac: use && instead of the (sigh) unportable -a * configure.ac: use AC_HELP_STRING * configure.ac, dft/codelet.h, dft/verify.c, kernel/Makefile.am, kernel/dfftw3.h, kernel/fftw3.h, kernel/ifftw.h, kernel/lfftw3.h, kernel/sfftw3.h, libbench/bench-main.c, libbench/bench-user.h, libbench/info.c, libbench/verify.c, simd/simd-sse2.h: support long-double precision * dft/rader.c: whoops whoops * dft/rader.c: whoops * TODO: buffered solver strides have been fixed * dft/rader.c: convention * TODO, dft/rader.c: share twiddle arrays in Rader * libbench/verify.c: call done() after verify 2002-07-12 Steven G. Johnson * tests/bench.c: output planner time with -v * kernel/print.c: support double outputs * dft/vrank-geq1.c: removed extraneous parens * dft/buffered.c: increase maxbufsz to 64k; makes a big difference for large 2d transforms 2002-07-12 Matteo Frigo * dft/vrank-geq1.c: Fix 2002-07-12 Steven G. Johnson * dft/rank-geq2.c: fix comment * kernel/tensor.c: fix in comment * ChangeLog: updated * TODO: buffered malloc's buffers * TODO, dft/rader.c: share more code between apply and apply_dit in Rader 2002-07-08 Matteo Frigo * simd/simd-sse.h, simd/simd-sse2.h, simd/sse.c, simd/sse2.c: Polished * support/Makefile.codelets: *** empty log message *** * dft/simd/codelets/Makefile.am, genfft/c.ml, genfft/gen_notw_c.ml, genfft/simd.ml, genfft/to_alist.ml, genfft/to_alist.mli, simd/simd-sse.h, simd/simd-sse2.h, support/Makefile.codelets: SIMD/FMA stuff * simd/simd-sse.h: Avoid code duplication * genfft/Makefile.am, genfft/to_alist.ml: Fixes for FMA+SIMD * dft/buffered.c, dft/codelets/standard/Makefile.am, dft/simd/Makefile.am, dft/simd/NAMING, dft/simd/codelets/Makefile.am, dft/simd/n1b.c, dft/simd/n1b.h, dft/simd/n1f.c, dft/simd/n1f.h, dft/simd/n2f.c, dft/simd/n2f.h, dft/simd/n3f.h, dft/simd/n4.c, dft/simd/n4.h, dft/simd/t1b.c, dft/simd/t1b.h, dft/simd/t1f.c, dft/simd/t1f.h, dft/simd/t2f.c, dft/simd/t2f.h, dft/simd/t3f.h, dft/simd/t4.c, dft/simd/t4.h, genfft/Makefile.am, genfft/algsimp.ml, genfft/annotate.ml, genfft/annotate.mli, genfft/c.ml, genfft/complex.ml, genfft/complex.mli, genfft/expr.ml, genfft/expr.mli, genfft/gen_athnotw.ml, genfft/gen_athtw.ml, genfft/gen_conv.ml, genfft/gen_hc2r.ml, genfft/gen_notw.ml, genfft/gen_notw_c.ml, genfft/gen_r2hc.ml, genfft/gen_trig.ml, genfft/gen_twiddle.ml, genfft/gen_twiddle_c.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, genfft/oracle.ml, genfft/simd.ml, genfft/simd.mli, genfft/simdmagic.ml, genfft/to_alist.ml, genfft/trig.ml, genfft/trig.mli, genfft/twiddle.ml, kernel/ifftw.h, simd/simd-sse.h, simd/simd-sse2.h, simd/sse.c, simd/sse2.c, support/Makefile.codelets: Major changes in SIMD fftw 2002-07-05 Matteo Frigo * dft/buffered.c, simd/simd-altivec.h, simd/simd-sse.h: Use unpck instructions instead of shuffles * dft/codelets/n.c, dft/codelets/t.c, dft/ct-ditbuf.c, dft/verify.c, kernel/ifftw.h, kernel/planner.c, tests/bench.c: Minor tweaks * tests/bench.c: Use score planner * CONVENTIONS, dft/Makefile.am, dft/dft.h, dft/verify.c, kernel/Makefile.am, kernel/debug.c, kernel/dotens.c, kernel/dotens2.c, kernel/ifftw.h, tests/bench.c: Added verifier 2002-07-04 Matteo Frigo * dft/buffered.c, dft/codelet.h, dft/codelets/n.c, dft/codelets/t.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct.c, dft/ct.h, dft/simd/Makefile.am, dft/simd/NAMING, dft/simd/codelets/Makefile.am, dft/simd/n2f.c, dft/simd/n2f.h, dft/simd/n3f.h, dft/simd/n4.c, dft/simd/t2f.c, dft/simd/t2f.h, dft/simd/t3f.h, dft/simd/t4.c, genfft-k7/gen_notw.ml, genfft-k7/gen_twiddle.ml, genfft/annotate.ml, genfft/gen_notw.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/simd.ml, genfft/simdmagic.ml, kernel/alloc.c, kernel/ifftw.h, simd/simd-sse.h, simd/simd-sse2.h: More simd codelets 2002-07-02 Matteo Frigo * dft/rank-geq2.c: Oops * dft/rank-geq2.c, dft/vrank-geq1.c: Fixed classic mode * genfft/simd.ml, simd/simd-altivec.h, simd/simd-sse.h, simd/simd-sse2.h: Use LDK for constants so that we can play games. * dft/codelet.h, dft/codelets/n.c, dft/simd/n4.c, dft/simd/t4.c, genfft-k7/gen_notw.ml, genfft/gen_notw.ml, genfft/genutil.ml, genfft/simd.ml, genfft/simd.mli, simd/simd-sse.h: Improved support for fixed strides * dft/codelet.h, dft/codelets/n.c, dft/codelets/n.h, dft/codelets/t.c, dft/codelets/t.h, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/direct.c, dft/k7/k7.c, dft/simd/n4.c, dft/simd/n4.h, dft/simd/t4.c, dft/simd/t4.h, genfft-k7/gen_notw.ml, genfft-k7/gen_twiddle.ml, genfft/gen_notw.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml: Changed accounting of flops * genfft-k7/algsimp.ml, genfft-k7/to_alist.ml, genfft/algsimp.ml, simd/simd-sse2.h: Wrong code in non-fma mode * genfft/simdmagic.ml, kernel/alloc.c, simd/Makefile.am, simd/simd-sse2.h, simd/sse2.c: sse2 stuff 2002-07-01 Matteo Frigo * Makefile.am, dft/ct.c, dft/direct.c, dft/k7/k7.c, dft/simd/n4.c, dft/simd/t4.c, kernel/alloc.c, simd/Makefile.am, simd/simd-altivec.h, simd/simd-sse.h, simd/sse.c: Identify CPUs for special codelets * libbench/problem.c: Change split problem syntax * dft/simd/codelets/Makefile.am: Removed -fma flag * simd/simd-altivec.h: Work around gcc bug 2002-06-30 Matteo Frigo * genfft/algsimp.ml, genfft/magic.ml, genfft/oracle.ml, genfft/simd.ml, genfft/to_alist.ml: New simd stuff * dft/simd/codelets/Makefile.am, simd/Makefile.am, simd/simd-altivec.h, simd/simd-sse.h: Added altivec support * dft/simd/t4.c: Forgot file * Makefile.am, configure.ac, dft/Makefile.am, dft/codelet-k7.h, dft/codelet.h, dft/codelets/Makefile.am, dft/codelets/f.h, dft/codelets/inplace/Makefile.am, dft/codelets/n.c, dft/codelets/n.h, dft/codelets/q.h, dft/codelets/standard/Makefile.am, dft/codelets/t.c, dft/codelets/t.h, dft/conf.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/ct.h, dft/direct.c, dft/k7/Makefile.am, dft/k7/codelets/Makefile.am, dft/k7/ct-dif.c, dft/k7/ct-dit.c, dft/k7/ct-ditbuf.c, dft/k7/direct.c, dft/k7/k7.c, dft/k7/kdft-dif.c, dft/k7/kdft-dit.c, dft/k7/kdft.c, dft/simd/Makefile.am, dft/simd/NAMING, dft/simd/codelets/Makefile.am, dft/simd/n4.c, dft/simd/n4.h, dft/simd/t4.h, genfft-k7/genUtil.ml, genfft-k7/gen_notw.ml, genfft-k7/gen_twiddle.ml, genfft/annotate.ml, genfft/c.ml, genfft/c.mli, genfft/gen_notw.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, genfft/magic.ml, genfft/simd.ml, genfft/simd.mli, genfft/simdmagic.ml, genfft/twiddle.ml, genfft/twiddle.mli, kernel/alloc.c, kernel/ifftw.h, libbench/bench-user.h, libbench/problem.c, libbench/util.c, simd/Makefile.am, simd/README, simd/simd-sse.h, simd/simd.h, support/Makefile.codelets, tests/Makefile.am, tests/bench.c: Progress towards simd implementation 2002-06-26 Matteo Frigo * dft/k7/codelets/Makefile.am: Add 128- codelet 2002-06-23 Matteo Frigo * configure.ac, genfft-k7/genUtil.ml, genfft-k7/gen_notw.ml, genfft-k7/gen_twiddle.ml, genfft/c.ml, genfft/c.mli, genfft/expr.ml, genfft/expr.mli, genfft/gen_hc2r.ml, genfft/gen_notw.ml, genfft/gen_r2hc.ml, genfft/gen_trig.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, genfft/simd.ml, genfft/simd.mli, libbench/bench-main.c: More simd changes. Ensure proper stack alignment in k7 codelets. 2002-06-22 Matteo Frigo * kernel/ifftw.h, kernel/solvtab.c: Fixed prototypes * kernel/cycle.h: Sparc cycle counter requires v9 * configure.ac, kernel/cycle.h, kernel/ifftw.h: Minor fixes * acinclude.m4: Fixed ev67 detection * tests/bench.c: Print flops * genfft/simd.ml: Nothing really * dft/codelet-k7.h, dft/codelet.h, genfft-k7/magic.ml, genfft-k7/to_alist.ml, genfft-k7/to_alist.mli, genfft/Makefile.am, genfft/c.ml, genfft/c.mli, genfft/gen_conv.ml, genfft/gen_hc2r.ml, genfft/gen_notw.ml, genfft/gen_r2hc.ml, genfft/gen_trig.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, genfft/magic.ml, genfft/simd.ml, genfft/simd.mli, genfft/simdmagic.ml, genfft/to_alist.ml, kernel/ifftw.h: More simd work 2002-06-21 Matteo Frigo * genfft/Makefile.am, genfft/annotate.ml, genfft/annotate.mli, genfft/magic.ml, genfft/simd.ml, genfft/simd.mli, genfft/simdmagic.ml: More simd work 2002-06-20 Matteo Frigo * genfft/Makefile.am, genfft/annotate.ml, genfft/c.ml, genfft/c.mli, genfft/magic.ml, genfft/simd.ml, genfft/variable.ml, genfft/variable.mli: More simd work * genfft/annotate.ml, genfft/annotate.mli, genfft/gen_athnotw.ml, genfft/gen_athtw.ml, genfft/gen_conv.ml, genfft/gen_hc2r.ml, genfft/gen_notw.ml, genfft/gen_r2hc.ml, genfft/gen_trig.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, genfft/magic.ml, genfft/twiddle.ml, genfft/twiddle.mli, genfft/variable.ml, genfft/variable.mli: Moving towards incorporation of simd stuff 2002-06-19 Matteo Frigo * Makefile.am, configure.ac, dft/Makefile.am, dft/codelets/Makefile.am, dft/ct-dif-k7.c, dft/ct-dit-k7.c, dft/ct-ditbuf-k7.c, dft/direct-k7.c, dft/k7/Makefile.am, dft/k7/codelets/Makefile.am, dft/k7/ct-dif.c, dft/k7/ct-dit.c, dft/k7/ct-ditbuf.c, dft/k7/direct.c, dft/k7/kdft-dif.c, dft/k7/kdft-dit.c, dft/k7/kdft.c, dft/kdft-dif-k7.c, dft/kdft-dit-k7.c, dft/kdft-k7.c: Reorganized k7 stuff into own directory * genfft-k7/expr.ml, genfft-k7/expr.mli, genfft/expr.ml, genfft/expr.mli, genfft/genutil.ml, genfft/magic.ml: Minor experimental stuff * genfft/expr.ml, genfft/expr.mli, genfft/genutil.ml: Cosmetic changes 2002-06-19 FFTW Maintainers * dft/buffered.c, dft/rader.c: allocate buffers on the fly 2002-06-18 Matteo Frigo * dft/Makefile.am, dft/codelet-k7.h, dft/ct-dif-k7.c, dft/ct-dif.c, dft/ct-dit-k7.c, dft/ct-dit.c, dft/ct-ditbuf-k7.c, dft/ct-ditbuf.c, dft/ct.c, dft/ct.h, dft/kdft-dit-k7.c, dft/rader.c, genfft-k7/Makefile.am, genfft-k7/assignmentsToVfpinstrs.ml, genfft-k7/gen_twiddle.ml, genfft-k7/k7Basics.ml, genfft-k7/k7Basics.mli, genfft-k7/k7RegisterAllocationBasics.ml, genfft-k7/k7RegisterAllocationBasics.mli, genfft-k7/k7RegisterAllocator.ml, genfft-k7/k7RegisterAllocatorInit.ml, genfft-k7/number.ml, genfft-k7/to_alist.ml, genfft/number.ml, genfft/to_alist.ml, kernel/ifftw.h, kernel/planner.c, kernel/primes.c: Added ct-ditbuf-k7.c . Major changes required in generator. * genfft-k7/gen_twiddle.ml, kernel/ifftw.h: Nothing, really * configure.ac: !SINGLE ==> !K7_MODE (for some reason the contrapositive sounds wrong) * dft/buffered.c: Buffer is now symmetric wrt forward/backward transform * dft/ct-dif.c, dft/indirect.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/Makefile.am, kernel/debug.c, kernel/ifftw.h, kernel/print.c: Fixed applicable() in indirect.c * dft/rader.c: Fixed attempt to free() uninitialized pointer. * CONVENTIONS, TODO, dft/rader.c, kernel/ifftw.h, kernel/plan.c, kernel/planner.c, tests/bench.c: Added reference counts for awake() 2002-06-18 Steven G. Johnson * dft/rader.c: updated comment * TODO: slight update 2002-06-17 FFTW Maintainers * dft/rader.c, kernel/Makefile.am, kernel/ifftw.h, kernel/primes.c: moved prime-number stuff into primes.c, so it can be shared with generic codelet and with rfftw rader * dft/rader.c: added comment * dft/rader.c, kernel/ifftw.h, kernel/twiddle.c: added rader-dit * configure.ac, dft/Makefile.am, dft/conf.c, dft/dft.h, dft/rader.c: added initial Rader (no DIT yet) * acinclude.m4: don't warn about long long 2002-06-17 Matteo Frigo * dft/Makefile.am, dft/codelet-k7.h, dft/ct-dif-k7.c, dft/ct.h, dft/kdft-dif-k7.c, genfft-k7/gen_twiddle.ml, kernel/planner.c, tests/bench.c: Added k7 DIF codelets 2002-06-16 Matteo Frigo * TODO: Added stuff to do * dft/rank0.c: Handle dual case R = I + 1 * bootstrap.sh: Removed useless flag * mkdist.sh: Removed useless file * CLASSIC-MODE, Makefile.am, configure.ac, dft/Makefile.am, dft/buffered.c, dft/codelet-k7.h, dft/codelet.h, dft/codelets/inplace/Makefile.am, dft/ct-dif.c, dft/ct-dit-k7.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.h, dft/direct-k7.c, dft/direct.c, dft/indirect.c, dft/kdft-dit-k7.c, dft/kdft-dit.c, dft/nop.c, dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, genfft-k7/Makefile.am, genfft-k7/genUtil.ml, genfft-k7/gen_notw.ml, genfft-k7/gen_twiddle.ml, genfft-k7/twiddle.ml, genfft-k7/twiddle.mli, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/twiddle.ml, genfft/twiddle.mli, kernel/ifftw.h, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, kernel/version.c, support/Makefile.codelets, tests/bench.c: More k7 work. Switched to runtime CLASSIC mode. 2002-06-16 Steven G. Johnson * kernel/tensor.c: spelling 2002-06-16 Matteo Frigo * dft/kdft-k7.c: Do not compile if not K7_MODE * dft/codelet-k7.h, dft/dft.h: Do not require K7 definitions to compile * dft/Makefile.am, dft/codelet-k7.h, dft/codelet.h, dft/direct-k7.c, genfft-k7/Makefile.am, genfft-k7/complex.ml, genfft-k7/complex.mli, genfft-k7/genUtil.ml, genfft-k7/gen_notw.ml, genfft-k7/gen_twiddle.ml: More k7 stuff 2002-06-15 Matteo Frigo * acinclude.m4: Try to be compatible with automake-1.6 * acinclude.m4, configure.ac, genfft-k7/Makefile.am, genfft-k7/algsimp.ml, genfft-k7/algsimp.mli, genfft-k7/assignmentsToVfpinstrs.ml, genfft-k7/assoctable.ml, genfft-k7/assoctable.mli, genfft-k7/complex.ml, genfft-k7/complex.mli, genfft-k7/expr.ml, genfft-k7/expr.mli, genfft-k7/exprdag.ml, genfft-k7/exprdag.mli, genfft-k7/genUtil.ml, genfft-k7/gen_notw.ml, genfft-k7/k7Unparsing.ml, genfft-k7/littlesimp.ml, genfft-k7/littlesimp.mli, genfft-k7/magic.ml, genfft-k7/monads.ml, genfft-k7/number.ml, genfft-k7/number.mli, genfft-k7/oracle.ml, genfft-k7/oracle.mli, genfft-k7/to_alist.ml, genfft-k7/to_alist.mli, genfft-k7/twiddle.ml, genfft-k7/twiddle.mli, genfft-k7/util.ml, genfft-k7/util.mli, genfft-k7/vFpUnparsing.ml, genfft-k7/vSimdBasics.ml, genfft-k7/vSimdUnparsing.ml, genfft-k7/variable.ml, genfft-k7/variable.mli, genfft/number.ml, support/Makefile.am, support/Makefile.codelets, support/codelet_asmprelude: More merging of Stefan's generator with main genfft branch * genfft-k7/Makefile.am, genfft-k7/complex.ml, genfft-k7/complex.mli, genfft-k7/expr.ml, genfft-k7/expr.mli, genfft-k7/exprdag.ml, genfft-k7/exprdag.mli, genfft-k7/fft.ml, genfft-k7/fft.mli, genfft-k7/genUtil.ml, genfft-k7/gen_hc2hc.ml, genfft-k7/gen_hc2real.ml, genfft-k7/gen_notw.ml, genfft-k7/gen_notwiddle.ml, genfft-k7/gen_notwiddle_fixedstride.ml, genfft-k7/gen_real2hc.ml, genfft-k7/gen_realeven.ml, genfft-k7/gen_realeven2.ml, genfft-k7/gen_realodd.ml, genfft-k7/gen_realodd2.ml, genfft-k7/gen_twiddle.ml, genfft-k7/magic.ml, genfft-k7/symmetry.ml, genfft-k7/twiddle.ml, genfft-k7/util.ml, genfft-k7/util.mli, genfft-k7/variable.ml, genfft-k7/variable.mli, genfft/expr.ml, genfft/expr.mli, genfft/genutil.ml, support/Makefile.codelets: Slowly merging genfft-k7 with main genfft branch * genfft-k7/Makefile.am, genfft-k7/genUtil.ml, genfft-k7/magic.ml, genfft-k7/magic.mli, genfft-k7/twiddle.ml, support/Makefile.codelets: Fixed, really * support/Makefile.codelets: Oops... * support/Makefile.codelets: Work properly when $(ALL_CODELETS) = "" * Makefile.am, configure.ac, dft/codelet.h, dft/codelets/Makefile.am, dft/conf.c, genfft-k7/gen_notwiddle.ml, kernel/ifftw.h, support/Makefile.codelets: Fixed k7 build machinery 2002-06-14 Matteo Frigo * Makefile.am, configure.ac, dft/codelet.h, dft/direct-k7.c, genfft-k7/Makefile.am, genfft-k7/codeletMisc.ml, genfft-k7/codeletMisc.mli, genfft-k7/genUtil.ml, genfft-k7/genUtil.mli, genfft-k7/gen_hc2hc.mli, genfft-k7/gen_hc2real.mli, genfft-k7/gen_notwiddle.ml, genfft-k7/gen_notwiddle.mli, genfft-k7/gen_real2hc.mli, genfft-k7/gen_realeven.mli, genfft-k7/gen_realeven2.mli, genfft-k7/gen_realodd.mli, genfft-k7/gen_realodd2.mli, genfft-k7/gen_twiddle.mli, genfft-k7/genfft.ml, genfft-k7/k7Basics.ml, genfft-k7/k7Basics.mli, genfft-k7/k7Unparsing.ml, genfft-k7/magic.ml, genfft-k7/magic.mli, kernel/ifftw.h, libbench/bench-user.h, support/Makefile.am, dft/Makefile.am, dft/conf.c, dft/dft.h, dft/kdft-k7.c, support/Makefile.codelets: More work on k7 stuff * dft/codelet.h, dft/direct.c, genfft/gen_notw.ml: Changed my mind again * genfft-k7/gen_notwiddle.ml: Removed some useless stuff. * genfft-k7/gen_notwiddle.ml: Hmm... * dft/codelet.h, dft/direct.c, genfft-k7/gen_notwiddle.ml, genfft/gen_notw.ml: More work in preparation for k7 stuff * TODO, dft/codelet.h, dft/direct.c, genfft/gen_notw.ml: Still preparing to include k7 stuff * bootstrap.sh: Create .depend * Makefile.am, bootstrap.sh, configure.ac, genfft-k7/Makefile.am, genfft-k7/assignmentsToVfpinstrs.ml, genfft-k7/assignmentsToVfpinstrs.mli, genfft-k7/balanceVfpinstrs.ml, genfft-k7/balanceVfpinstrs.mli, genfft-k7/codeletMisc.ml, genfft-k7/codeletMisc.mli, genfft-k7/complex.ml, genfft-k7/complex.mli, genfft-k7/expr.ml, genfft-k7/expr.mli, genfft-k7/exprdag.ml, genfft-k7/exprdag.mli, genfft-k7/fft.ml, genfft-k7/fft.mli, genfft-k7/genUtil.ml, genfft-k7/genUtil.mli, genfft-k7/gen_hc2hc.ml, genfft-k7/gen_hc2hc.mli, genfft-k7/gen_hc2real.ml, genfft-k7/gen_hc2real.mli, genfft-k7/gen_notwiddle.ml, genfft-k7/gen_notwiddle.mli, genfft-k7/gen_notwiddle_fixedstride.ml, genfft-k7/gen_real2hc.ml, genfft-k7/gen_real2hc.mli, genfft-k7/gen_realeven.ml, genfft-k7/gen_realeven.mli, genfft-k7/gen_realeven2.ml, genfft-k7/gen_realeven2.mli, genfft-k7/gen_realodd.ml, genfft-k7/gen_realodd.mli, genfft-k7/gen_realodd2.ml, genfft-k7/gen_realodd2.mli, genfft-k7/gen_twiddle.ml, genfft-k7/gen_twiddle.mli, genfft-k7/genfft.ml, genfft-k7/id.ml, genfft-k7/id.mli, genfft-k7/k7Basics.ml, genfft-k7/k7Basics.mli, genfft-k7/k7ExecutionModel.ml, genfft-k7/k7ExecutionModel.mli, genfft-k7/k7FlatInstructionScheduling.ml, genfft-k7/k7FlatInstructionScheduling.mli, genfft-k7/k7InstructionSchedulingBasics.ml, genfft-k7/k7InstructionSchedulingBasics.mli, genfft-k7/k7RegisterAllocationBasics.ml, genfft-k7/k7RegisterAllocationBasics.mli, genfft-k7/k7RegisterAllocator.ml, genfft-k7/k7RegisterAllocator.mli, genfft-k7/k7RegisterAllocatorEATranslation.ml, genfft-k7/k7RegisterAllocatorEATranslation.mli, genfft-k7/k7RegisterAllocatorInit.ml, genfft-k7/k7RegisterAllocatorInit.mli, genfft-k7/k7RegisterReallocation.ml, genfft-k7/k7RegisterReallocation.mli, genfft-k7/k7Translate.ml, genfft-k7/k7Translate.mli, genfft-k7/k7Unparsing.ml, genfft-k7/k7Unparsing.mli, genfft-k7/k7Vectorization.ml, genfft-k7/k7Vectorization.mli, genfft-k7/magic.ml, genfft-k7/magic.mli, genfft-k7/memoMonad.ml, genfft-k7/memoMonad.mli, genfft-k7/nonDetMonad.ml, genfft-k7/nonDetMonad.mli, genfft-k7/nullVectorization.ml, genfft-k7/nullVectorization.mli, genfft-k7/number.ml, genfft-k7/number.mli, genfft-k7/stateMonad.ml, genfft-k7/stateMonad.mli, genfft-k7/symmetry.ml, genfft-k7/twiddle.ml, genfft-k7/util.ml, genfft-k7/util.mli, genfft-k7/vAnnotatedScheduler.ml, genfft-k7/vAnnotatedScheduler.mli, genfft-k7/vDag.ml, genfft-k7/vDag.mli, genfft-k7/vFpBasics.ml, genfft-k7/vFpBasics.mli, genfft-k7/vFpUnparsing.ml, genfft-k7/vFpUnparsing.mli, genfft-k7/vImproveSchedule.ml, genfft-k7/vImproveSchedule.mli, genfft-k7/vK7Optimization.ml, genfft-k7/vK7Optimization.mli, genfft-k7/vScheduler.ml, genfft-k7/vScheduler.mli, genfft-k7/vSimdBasics.ml, genfft-k7/vSimdBasics.mli, genfft-k7/vSimdIndexing.ml, genfft-k7/vSimdIndexing.mli, genfft-k7/vSimdUnparsing.ml, genfft-k7/vSimdUnparsing.mli, genfft-k7/variable.ml, genfft-k7/variable.mli, AUTHORS: Imported Stefan's K7 generator 2002-06-13 Matteo Frigo * genfft/Makefile.am, genfft/c.ml, genfft/complex.ml, genfft/complex.mli, genfft/gen_hc2r.ml, genfft/gen_r2hc.ml, genfft/genutil.ml, genfft/trig.ml: Generator for real->halfcomplex and halfcomplex->real codelets * dft/problem.c, kernel/planner.c, kernel/tensor.c, tests/bench.c: Improved hash functions, printers * support/Makefile.codelets: Only regenerate codlist.c in maintainer mode * dft/problem.c, dft/rank-geq2.c, kernel/ifftw.h, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, kernel/print.c, kernel/tensor.c, tests/bench.c: Planner can export solution list * dft/ct-ditbuf.c, dft/dft.h, dft/direct.c, kernel/cycle.h, libbench/bench-user.h: Fixed for intel compiler * dft/codelet.h, genfft/c.ml, genfft/c.mli, genfft/gen_notw.ml, genfft/gen_trig.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/magic.ml: Revised strategy for constants in codelets * tests/bench.c: Enable score planner in classic mode, naive planner in pro mode. 2002-06-12 Matteo Frigo * tests/bench.c: Report classic/pro * dft/buffered.c, tests/bench.c: Fixed behavior of buffered solver for large buffers. * dft/rank-geq2.c, dft/vrank-geq1.c, kernel/ifftw.h, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, kernel/timer.c, libbench/timer.c, tests/bench.c: Make assumption COST(vector) = length * COST(scalar) in classic mode. * kernel/ifftw.h, kernel/plan.c, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, support/Makefile.codelets: Revised planner implementation in preparation for wisdom. * dft/ct-ditbuf.c: Manually hoist loop invariants. * dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c: Revised loop to compile better with gcc -O 2002-06-11 Matteo Frigo * kernel/tensor.c: Changed tensor syntax * TODO: Added stuff to do. * kernel/version.c: Report classic/pro in version number * CLASSIC-MODE, Makefile.am, RESEARCH-MODE, bootstrap.sh, configure.ac, dft/codelets/inplace/Makefile.am, dft/ct-dit.c, dft/ct-ditbuf.c, dft/kdft-dit.c, dft/rank-geq2.c, dft/vrank-geq1.c, kernel/ifftw.h, mkdist.sh, tests/bench.c: Renamed versions into classic/pro * kernel/Makefile.am, kernel/ifftw.h, kernel/planner-estimate.c, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, tests/bench.c: Revised planners, estimator * Makefile.am, dft/buffered.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/kdft-dif.c, dft/kdft-difsq.c, dft/kdft-dit.c, kernel/ifftw.h: I don't know what I am doing. * Makefile.am, dft/buffered.c, dft/codelet.h, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct-ditf.c, dft/ct.c, dft/dft.h, dft/direct.c, dft/indirect.c, dft/nop.c, dft/rank-geq2.c, dft/rank0.c, dft/vrank-geq1.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, genfft/c.ml, kernel/Makefile.am, kernel/cycle.h, kernel/flops.c, kernel/ifftw.h, kernel/ops.c, kernel/plan.c, kernel/planner-estimate.c, kernel/planner-naive.c, kernel/planner-score.c, tests/bench.c: Massive revision of estimator * dft/Makefile.am, dft/buffered.c, dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, dft/conf.c, dft/ct-dit.c, dft/ct.c, dft/dft.h, dft/indirect.c, dft/problem.c, dft/rank-geq2.c, dft/vecloop.c, dft/vrank-geq1.c, kernel/ifftw.h, kernel/planner-estimate.c, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, kernel/print.c, kernel/tensor.c, kernel/timer.c, libbench/bench-main.c, tests/bench.c: Many changes * dft/ct-ditbuf.c: Keep it simple, stupid. 2002-06-10 Matteo Frigo * kernel/ifftw.h: Fixed when #undef PRECOMPUTE_ARRAY_INDICES * dft/vrank3-transpose.c, kernel/print.c: Minor changes * CONVENTIONS, configure.ac, dft/Makefile.am, dft/buffered.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditbuf.c, dft/ct.c, dft/ct.h, dft/dft.h, dft/direct.c, dft/indirect.c, dft/kdft-dif.c, dft/kdft-difsq.c, dft/kdft-dit.c, dft/kdft.c, dft/nop.c, dft/rank-geq2.c, dft/rank0.c, dft/vecloop.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/ifftw.h, kernel/plan.c, kernel/problem.c, kernel/timer.c, tests/bench.c: Added ct-ditbuf.c, many changes everywhere * kernel/ifftw.h, kernel/planner.c, tests/bench.c: More name mangling * Makefile.am, acinclude.m4, configure.ac, tests/Makefile.am: Fixed build system for single/double precision * CONVENTIONS, configure.ac, dft/buffered.c, dft/codelet.h, dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, dft/conf.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditf.c, dft/ct.c, dft/ct.h, dft/dft.h, dft/direct.c, dft/indirect.c, dft/kdft-dif.c, dft/kdft-difsq.c, dft/kdft-dit.c, dft/kdft.c, dft/nop.c, dft/plan.c, dft/problem.c, dft/rank-geq2.c, dft/rank0.c, dft/solve.c, dft/vecloop.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, genfft/gen_notw.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, kernel/Makefile.am, kernel/alloc.c, kernel/assert.c, kernel/awake.c, kernel/cycle.h, kernel/dfftw3.h, kernel/fftw.h, kernel/fftw3.h, kernel/flops.c, kernel/ifftw.h, kernel/minmax.c, kernel/plan.c, kernel/planner-estimate.c, kernel/planner-naive.c, kernel/planner-score.c, kernel/planner.c, kernel/print.c, kernel/problem.c, kernel/sfftw3.h, kernel/solver.c, kernel/solvtab.c, kernel/square.c, kernel/stride.c, kernel/tensor.c, kernel/timer.c, kernel/twiddle.c, kernel/version.c, support/Makefile.codelets, tests/bench.c: Massive renaming to support both single and double precision. (Must recompile everything twice). * libbench/allocate.c, libbench/bench-user.h, libbench/mflops.c, libbench/problem.c, tests/bench.c: Preliminary crude support for vector transforms in benchmark library. * kernel/tensor.c: Wrong cast 2002-06-09 Matteo Frigo * TODO: Added things to do. * kernel/twiddle.c: twlen0: make static * dft/buffered.c: Nothing * kernel/print.c: Forgot break in switch statement. * kernel/print.c: Fix for c++ compatibility * TODO, dft/buffered.c, dft/ct.c, dft/direct.c, dft/indirect.c, dft/nop.c, dft/rank-geq2.c, dft/rank0.c, dft/vecloop.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, kernel/Makefile.am, kernel/ifftw.h, kernel/planner.c, kernel/print.c, tests/bench.c: Added printer, changed everything * dft/buffered.c, dft/nop.c, dft/rank0.c, tests/bench.c: Removed redundant nop solver * TODO: More things to do * TODO, dft/Makefile.am, dft/buffered.c, dft/conf.c, dft/dft.h, dft/direct.c, dft/indirect.c, dft/nop.c, dft/problem.c, dft/rank0.c, dft/vecloop.c, kernel/ifftw.h, kernel/tensor.c: Introduced idea of rank -infinity and associated NOP plans * dft/buffered.c: Fixed comment * kernel/tensor.c: Removed useless assertions. * kernel/tensor.c: Don't malloc(0). * dft/buffered.c: Fixed signed/unsigned puns * dft/Makefile.am, dft/buffered.c, dft/conf.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditf.c, dft/dft.h, dft/direct.c, dft/indirect.c, dft/rank-geq2.c, dft/rank0.c, dft/vecloop.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, libbench/bench-main.c, libbench/bench-user.h, tests/bench.c: Added buffered.c 2002-06-08 Matteo Frigo * dft/ct.c: Fixed printout * dft/vrank3-transpose.c: Fixed comment * dft/Makefile.am, dft/conf.c, dft/dft.h, dft/vrank0-transpose.c, dft/vrank2-transpose.c, dft/vrank3-transpose.c, tests/bench.c: Added vrank3-transpose, renamed vrank0-transpose -> vrank2-transpose * bootstrap.sh, dft/Makefile.am, dft/conf.c, dft/direct.c, dft/rank-geq2.c, dft/rank0.c, dft/rank_geq2.c, dft/vrank0-transpose.c, tests/bench.c: Added vrank0-transpose * dft/Makefile.am, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditf.c, dft/direct.c, dft/indirect.c, dft/rank0.c, dft/rank_geq2.c, dft/vecloop.c, kernel/Makefile.am, kernel/ifftw.h, kernel/planner-score.c, kernel/planner.c, tests/bench.c: Added planner-score.c * dft/Makefile.am, dft/conf.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditf.c, dft/dft.h, dft/indirect.c, dft/rank_geq2.c, dft/vecloop.c: Added indirect.c * Makefile.am, dft/Makefile.am, dft/codelet.h, dft/codelets/Makefile.am, dft/codelets/inplace/Makefile.am, dft/conf.c, dft/ct-dif.c, dft/ct-dit.c, dft/ct-ditf.c, dft/dft.h, dft/direct.c, dft/kdft-dif.c, dft/kdft-difsq.c, mkdist.sh, tests/Makefile.am: dif, ditf solvers 2002-06-07 Matteo Frigo * Makefile.am, RESEARCH-MODE, bootstrap.sh, configure.ac, dft/Makefile.am, dft/conf.c, dft/dft.h, dft/rank_geq2.c, dft/vecloop.c, kernel/ifftw.h, kernel/minmax.c, kernel/planner.c, support/Makefile.codelets: Implemented rank_geq2. Revised build system * kernel/alloc.c: Fixed printout * Makefile.am, bootstrap.sh, configure.ac, dft/Makefile.am, dft/codelet.h, dft/codelets/Makefile.am, dft/codelets/inplace/Makefile.am, dft/codelets/standard/Makefile.am, dft/conf.c, dft/dft.h, dft/rank0.c, genfft/gen_notw.ml, tests/Makefile.am, tests/bench.c: Added rank0. Revised codelet organization. 2002-06-06 Matteo Frigo * dft/ct.c, dft/vecloop.c, genfft/trig.ml, kernel/ifftw.h, kernel/planner-estimate.c, kernel/planner-naive.c, kernel/planner.c, libbench/bench-user.h, tests/bench.c: Added memoization * dft/Makefile.am, dft/dft.h, dft/direct.c, dft/vecloop.c, kernel/alloc.c, kernel/ifftw.h, kernel/planner.c, tests/bench.c: Added vecloop 2002-06-05 Matteo Frigo * dft/Makefile.am, dft/ct-dit.c, dft/ct.c, dft/ct.h, dft/dft.h, dft/direct.c, dft/kdft-dit.c, kernel/alloc.c, kernel/twiddle.c: First DIT solver/plan * dft/Makefile.am, dft/ct.c, dft/ct.h, kernel/ifftw.h, kernel/stride.c, kernel/twiddle.c: More work on ct * kernel/ifftw.h, kernel/planner-naive.c, kernel/timer.c: Only use cycle counters * CONVENTIONS, bootstrap.sh, kernel/ifftw.h, kernel/twiddle.c: Signed/unsigned fixup * kernel/Makefile.am, kernel/ifftw.h, kernel/twiddle.c: New file twiddle.c 2002-06-04 Matteo Frigo * configure.ac, dft/Makefile.am, dft/direct.c, dft/kdft-dit.c, dft/problem.c, genfft/gen_notw.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, kernel/Makefile.am, kernel/codelet.h, kernel/flops.c, kernel/ifftw.h, kernel/tensor.c, kernel/timer.c, tests/bench.c: Made tensor ranks and vector lengths unsigned. Hopefully fixed all places where it matters. * Makefile.am, configure.ac, dft/Makefile.am, dft/dft.h, dft/direct.c, dft/kdft.c, kernel/Makefile.am, kernel/codelet.h, kernel/fftw.h, kernel/ifftw.h, kernel/planner-estimate.c, kernel/planner-naive.c, kernel/solvtab.c, libbench/Makefile.am, support/Makefile.codelets, tests/Makefile.am, tests/bench.c: System is in working state now (but very incomplete) 2002-06-03 Matteo Frigo * CONVENTIONS, kernel/Makefile.am, kernel/ifftw.h, kernel/planner-naive.c, kernel/planner.c: Started implementing planners * Makefile.am, configure.ac, libbench/Makefile.am, libbench/accopy-from.c, libbench/accopy-to.c, libbench/acopy.c, libbench/allocate.c, libbench/ascale.c, libbench/aset.c, libbench/bench-main.c, libbench/bench-user.h, libbench/bench.h, libbench/caadd.c, libbench/cacopy.c, libbench/can-do.c, libbench/cascale.c, libbench/caset.c, libbench/casub.c, libbench/ccopy-from.c, libbench/ccopy-to.c, libbench/copy-c2c-from.c, libbench/copy-c2c-to.c, libbench/copy-c2h-1d-fftpack.c, libbench/copy-c2h-1d-halfcomplex.c, libbench/copy-c2h-1d-packed.c, libbench/copy-c2h-1d-unpacked-ri.c, libbench/copy-c2h-unpacked.c, libbench/copy-c2h.c, libbench/copy-c2r-packed.c, libbench/copy-c2r-unpacked.c, libbench/copy-c2r.c, libbench/copy-c2ri.c, libbench/copy-h2c-1d-fftpack.c, libbench/copy-h2c-1d-halfcomplex.c, libbench/copy-h2c-1d-packed.c, libbench/copy-h2c-1d-unpacked-ri.c, libbench/copy-h2c-unpacked.c, libbench/copy-h2c.c, libbench/copy-r2c-packed.c, libbench/copy-r2c-unpacked.c, libbench/copy-r2c.c, libbench/copy-ri2c.c, libbench/deallocate.c, libbench/getopt-utils.c, libbench/getopt.c, libbench/getopt.h, libbench/getopt1.c, libbench/info.c, libbench/log2.c, libbench/main.c, libbench/mflops.c, libbench/ovtpvt.c, libbench/pow2.c, libbench/prime.c, libbench/problem.c, libbench/report.c, libbench/speed.c, libbench/timer.c, libbench/unnormalize.c, libbench/util.c, libbench/verify.c, libbench/zero.c: Imported libbench from the new benchfft. We will use libbench for benchmarking and testing. * kernel/Makefile.am, kernel/ifftw.h, kernel/rand.c, kernel/timer.c: Removed useless rand.c * CONVENTIONS, dft/problem.c, kernel/Makefile.am, kernel/alloc.c, kernel/cycle.h, kernel/ifftw.h, kernel/plan.c, kernel/timer.c: Added timer * configure.ac: Split codelets into standard and inplace 2002-06-02 Matteo Frigo * CONVENTIONS, Makefile.am, dft/Makefile.am, dft/dft.h, dft/direct.c, dft/plan.c, dft/problem.c, dft/solve.c, kernel/Makefile.am, kernel/awake.c, kernel/ifftw.h, kernel/square.c: Many many changes * kernel/codelet.h: Fixed anachronism * kernel/assert.c, kernel/codelet.h, kernel/fftw.h, kernel/flops.c, kernel/ifftw.h, kernel/minmax.c, kernel/plan.c, kernel/problem.c, kernel/rand.c, kernel/solver.c, kernel/stride.c, kernel/tensor.c, kernel/version.c, support/Makefile.am, support/Makefile.codelets, support/codelet_prelude: Initial import * kernel/assert.c, kernel/codelet.h, kernel/fftw.h, kernel/flops.c, kernel/ifftw.h, kernel/minmax.c, kernel/plan.c, kernel/problem.c, kernel/rand.c, kernel/solver.c, kernel/stride.c, kernel/tensor.c, kernel/version.c, support/Makefile.am, support/Makefile.codelets, support/codelet_prelude: New file. * genfft/to_alist.ml, genfft/to_alist.mli, genfft/trig.ml, genfft/trig.mli, genfft/twiddle.ml, genfft/twiddle.mli, genfft/unique.ml, genfft/unique.mli, genfft/util.ml, genfft/util.mli, genfft/variable.ml, genfft/variable.mli, kernel/Makefile.am, kernel/alloc.c: Initial import * genfft/to_alist.ml, genfft/to_alist.mli, genfft/trig.ml, genfft/trig.mli, genfft/twiddle.ml, genfft/twiddle.mli, genfft/unique.ml, genfft/unique.mli, genfft/util.ml, genfft/util.mli, genfft/variable.ml, genfft/variable.mli, kernel/Makefile.am, kernel/alloc.c: New file. * genfft/littlesimp.mli, genfft/magic.ml, genfft/monads.ml, genfft/number.ml, genfft/number.mli, genfft/oracle.ml, genfft/oracle.mli, genfft/schedule.ml, genfft/schedule.mli: Initial import * genfft/littlesimp.mli, genfft/magic.ml, genfft/monads.ml, genfft/number.ml, genfft/number.mli, genfft/oracle.ml, genfft/oracle.mli, genfft/schedule.ml, genfft/schedule.mli: New file. * genfft/gen_athtw.ml, genfft/gen_conv.ml, genfft/gen_notw.ml, genfft/gen_trig.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, genfft/littlesimp.ml: Initial import * genfft/gen_athtw.ml, genfft/gen_conv.ml, genfft/gen_notw.ml, genfft/gen_trig.ml, genfft/gen_twiddle.ml, genfft/gen_twidsq.ml, genfft/genutil.ml, genfft/littlesimp.ml: New file. * genfft/c.ml, genfft/c.mli, genfft/complex.ml, genfft/complex.mli, genfft/conv.ml, genfft/conv.mli, genfft/dag.ml, genfft/dag.mli, genfft/expr.ml, genfft/expr.mli, genfft/fft.ml, genfft/fft.mli, genfft/gen_athnotw.ml: Initial import * genfft/c.ml, genfft/c.mli, genfft/complex.ml, genfft/complex.mli, genfft/conv.ml, genfft/conv.mli, genfft/dag.ml, genfft/dag.mli, genfft/expr.ml, genfft/expr.mli, genfft/fft.ml, genfft/fft.mli, genfft/gen_athnotw.ml: New file. * genfft/algsimp.ml, genfft/algsimp.mli, genfft/annotate.ml, genfft/annotate.mli, genfft/assoctable.ml, genfft/assoctable.mli: Initial import * genfft/algsimp.ml, genfft/algsimp.mli, genfft/annotate.ml, genfft/annotate.mli, genfft/assoctable.ml, genfft/assoctable.mli: New file. * acinclude.m4, bootstrap.sh, configure.ac, dft/Makefile.am, dft/dft.h, dft/problem.c, genfft/Makefile.am: Initial import * acinclude.m4, bootstrap.sh, configure.ac, dft/Makefile.am, dft/dft.h, dft/problem.c, genfft/Makefile.am: New file. * AUTHORS, COPYRIGHT, ChangeLog, Makefile.am, NEWS, README: Initial import * AUTHORS, COPYRIGHT, ChangeLog, Makefile.am, NEWS, README: New file.