Release Notes

From Jacket Wiki

Jump to: navigation, search
Back to Documentation

Contents

Nightly Build

See Nightly

Version 2.0

A GPU Mandelbrot Set, generated using ARRAYFUN
Genomics - Multidimensional Scaling for analysis of Crop Gene Expression Patterns
Graphics library: simulating shallow water equations with reflective boundary conditions
Graphics library: image registration visualizing all flow gradients as scatter plot
  • Released 16 November, 2011
  • Requires CUDA 4.0 drivers: version 275 or higher on Windows, Linux, and OSX
  • Additional examples in jacket/examples/:
    • JacketDemo - runs a subset of the included examples as side-by-side CPU/GPU comparison
    • black_scholes_example and black_scholes_mgl - Black-Scholes option pricing using all available GPUs
    • brainseg - maximum-likelihood image segmentation of an MRI
    • genomics_mds - High-Throughput Dimensional Scaling (HiT-MDS) to analyze genomic data
    • heart - real-time volume filtering of a heart MRI
    • mandelbrot - Real-time, large domain Mandelbrot image construction
    • music_visualizer - Signal analysis and music beat visualization (see more)
    • volume - real-time volume rendering of the brain

Version 1.8

  • Requires CUDA 4.0 drivers:
  • Built against the CUDA 4.0 Toolkit. We package the CUDA libraries, so users are not required to install this toolkit.

09/29/2011 -v1.8.2

  • New features
    • GINFO sorts devices by theoretical FLOPS (forums)
    • GSELECT between devices any time (not just at startup)
  • Improvements
    • IFFT, IFFT2, IFFTN up to 10 percent faster
    • More specific licensing error messages for various cases.
    • Switch GRAND,GRANDN to use NVIDIA's CURAND library (3x or more speedup)
  • Changes
    • CUMSUM bug fixed inside GFOR (forums)
    • Memory leak inside Reductions fixed (SUM, PROD, MIN, MAX)
    • BWDIST now works properly at larger sizes (forums)
    • MPOWER bug for A^k where k=13
    • Fixed bug for too many file handles being opened
    • Fixed complex imprecision issue for mrdivide and rdivide
    • Fixed an "Invalid MEX File" Error occurring with Jacket 1.8.1 (forums)

08/22/2011 - v1.8.1

  • New features
    • BWLABEL
    • REGIONPROPS
    • New and Improved "Jacket Demo" (see examples/JacketDemo)
    • New Music Visualizer example (described here)
    • New Optical Flow Tracking example (see examples/oflow_example)
    • New Black-Scholes example using GCOMPILE examples/black_scholes_example. ~35X faster than CPU.
  • Improvements
    • Issue with Jacket over Remote Desktop reported on the forums fixed
      • The issue also was preventing GSELECT from functioning properly on some Windows machines.
    • Jacket now gives LM_LICENSE_FILE and LM_LICENSE_PATH precedence over engine/jlicense.dat (When both exist).
    • TIMEIT accepts new calling syntaxes
    • MLDIVIDE is faster for small over-determined systems inside GFOR. (Number of unknowns <= 32).
    • IMHIST generally faster for compute capability >= 1.2
  • API Changes
    • MTIMES returns GPU scalars even if the inputs are vectors.

07/18/2011 - v1.8

  • New Subscripting Features(SUBSREF,SUBSASGN)
    • Double-precision indexing (forum posts 1, 2)
    • Removing items: A(index)=[] (discrete, sorted idx)
    • 5x faster A(boolean_mask)=scalar
  • Mac OSX only supports 64-bit Snow Leopard 10.6.3 or later (discontinued Leopard 10.5 and 32-bit)
  • New functions:
  • Improvements:
  • Examples have been overhauled:
    • Updated so CPU and GPU use the same code (see "Write once, run everywhere")
    • New mandelbrot_example: Using ARRAYFUN to generate a Mandelbrot set - gives typically a 1000x speedup
    • New brainseg example: Segmentation of cranium scan
    • New genomics_mds_example: Multidimensional Scaling for Genomic analysis
    • For a complete description of the current examples: Examples
  • API Changes:
    • SUM, PROD, MIN, MAX, ANY, ALL on vector inputs now return GPU scalars instead of CPU scalars.
  • Graphics:
    • PLOT, PLOT3 now accept linestyle string parameter to set color
    • axis rescaling and panning now corrected so that ticks always visible
    • Added GCOLORMAP, GTITLE, GDRAWNOW
    • Mac OSX support
    • Improved Windows Performance
  • DISPLAY, DISP view contents of each GFOR tile instead of only the first
  • Sparse matrix support for BICGSTAB; support for older versions of MATLAB
  • In many cases, pre-allocation before subscript assignment is no longer needed

Example:

% v1.7
A = gones(n,n);
gfor ii = 1:n
  A(ii,:) = gones(1,n);
gend
 
% v1.8 (no need to preallocate A)
gfor ii = 1:n
  A(ii,:) = gones(1,n);
gend

Version 1.7

New graphics library: image registration using classic Horn-Schunck, Gauss-Newton
New graphics library: debugging image registration code
New graphics library: Simple Example
New graphics library: Tweaked FDTD
  • Requires CUDA 3.2 drivers:
  • Built against the CUDA 3.2 Toolkit. We package the CUDA libraries, so users are not required to install this toolkit.

03/21/2011 - v1.7.1

Enhancements:

  • CUMSUM handles matrices up to 4 dimensions.
  • Cleaned up several examples to use new CLASS syntax to avoid separate CPU and GPU code (see blog)
    • pi, fft, blas, fdtd
  • GFOR-enabled MPOWER

Bug fixes:

  • CULA initialization error on Quadro cards
  • Mac load error for libkublas
  • GRANDN single-precision was producing NaN values
  • Issues with IMHIST for large matrix sizes
  • MTIMES has been fixed for sparse matrix * dense matrix multiplication
  • INTERP1 now works properly for yi = interp1(x,Y,xi,method)*

02/26/2011 - v1.7

New graphics library visualizing FDTD simulation
  • Enhanced convolutions: CONV, CONV2, CONVN
    • No kernel size limitation
    • Supporting double precision and complex numbers
    • Full support with GFOR
  • Graphics Library Refresh
    • Optimized for minimum compute thread impact both in MATLAB and in standalone C++ apps
    • Axis tick marking, automatic scaling of plot data, new slick look
    • Supported primitives: SURF, PLOT, IMAGESC, SCATTER3
      • sophisticated syntaxes on the way (e.g. plot linestyle, marker shape, marker size)
    • New (updated) functions: GHOLD, GSUBPLOT, GCLF, GDRAWNOW (GFIGURE, GCLOSE)
    • IMAGESC performance greatly improved
    • Jacket OpenGL context creation only on demand (compatibility with applications which require their own GL contexts)
    • Drastically reduced disk, memory, and processor footprint
    • CUDA independent graphics mode
  • Improvements
    • ONES, ZEROS, RAND, RANDN, INF, NAN, etc. now available via new usage of CLASS
    • New functions GINF and GNAN added
    • TIMEIT provides robust estimates of both CPU and GPU code snippets (see demos/blas_example/)
    • Support for Remote Desktop (see blog post)
    • New GCOMPILE features:
      • BREAK, RETURN, and CONTINUE
      • Additional EPS syntax: eps('single') and eps('double')
      • Expanded FOR loop support - Instead of only hard-coded literal values, any COLON expression may be used.
  • Bug Fixes
    • GPROFILE
      • now deals correctly with GRANDN, non .m files, mixed precision, NAN
    • Memory leaks fixed as posted on the (forums)
    • FFT plan caching fix which was affecting some combinations
    • GMEX updates for CUDA 3.2, fixes
    • GCACHE corrections to avoid instability caused by presence of older Jacket cache files and now portable across 32bit and 64bit architectures.
  • Limitations
    • The graphics library is not supported for MAC systems in this release.

Versions from 2010 and earlier

For older versions, visit Release Notes 2010 and Earlier.

Personal tools