wiki:ReplacingGMPNotes/TheCurrentGMPImplementation

Version 10 (modified by p_tanski, 7 years ago) (diff)

add clarification for GMP function allocation from GHC heap

The Current GMP Implementation

Esa Ilari Vuokko, who at one time attempted to replace GMP with LibTomMath, posted several messages with good notes on the current implementation. Much of what is on this page is derived from those notes. See, Replacement for GMP(3) and Replacement for GMP(4).

GMP and the Storage Manager

The most insidious and unique feature of the GHC implementation with GMP is memory management. GHC uses the special memory functions in GMP ( mp_set_memory_functions( (*alloc)(), (*realloc)(), (*dealloc)() ) to let GMP use GHC's own garbage collector. The memory functions are implemented in rts/sm/Storage.c as:

	  static void* stgAllocForGMP (size_t size_in_bytes);
	  static void  stgDeallocForGMP (void *ptr STG_UNUSED, 
		  			 size_t size STG_UNUSED);
	  static void* stgReallocForGMP (void *ptr, 
					 size_t old_size, 
					 size_t new_size);

These special allocation functions bring most GMP memory usage into the GHC heap but do not seem that efficient otherwise. (I could be wrong --PDT.) Allocation uses the internal allocate() interface, so no garbage collection is performed during a GMP operation. Note that GMP operations may use these functions to allocate more memory themselves, after being called from Haskell code. The memory allocated is a simple array of words (W_), rounded up to a whole number.

Special Functions

GHC adds its own functions for string conversion, least common multiple (lcm) and conversion to and from floating point (floats and doubles). In particular, the GHC designers decided to perform their own operations for encoding a GMP number (really, the array of mp_limb_t) to floating point numbers (floats or doubles). GMP provides functions for some of these operations:

Operation GMP function Notes
convert to double mpz_get_d() rounds toward zero, avoids overflow
convert to double
return exponent separately
mpz_get_d_2exp() rounds toward zero, avoids overflow
convert to float mpz_get_f()
convert to string
supports bases from 2 to 36
mpz_get_str() more efficient than Haskell version,
with PackedString, no conversion to Haskell string necessary
conversion from string mpz_inp_str() more efficient than Haskell version,
with PackedString, input string efficiently convertable to a c_str
(it is already an array of chars)

The real problem (no pun intended) is conversion to doubles and floats. The __encodeDouble function, in rts/StgPrimFloat.c, may overflow:

StgDouble
__encodeDouble (I_ size, StgByteArray ba, I_ e) /* result = s * 2^e */
{
    StgDouble r;
    const mp_limb_t *const arr = (const mp_limb_t *)ba;
    I_ i;

    /* Convert MP_INT to a double; knows a lot about internal rep! */
    for(r = 0.0, i = __abs(size)-1; i >= 0; i--)
	r = (r * GMP_BASE) + arr[i];

    /* Now raise to the exponent */
    if ( r != 0.0 ) /* Lennart suggests this avoids a bug in MIPS's ldexp */
	r = ldexp(r, e);

    /* sign is encoded in the size */
    if (size < 0)
	r = -r;

    return r;
}

If the problem section hurt your eyes, here is some more pain to drive the point home:

    /* __abs(size) is the size of the array --PDT */
    for(r = 0.0, i = __abs(size)-1; i >= 0; i--)
	r = (r * GMP_BASE) + arr[i];

Note the problems:

  • GMP numbers may be very large, much greater than the maximum value for a float (addition and multiplication operations may overflow);
  • there is no specification of rounding mode--this operation relies on whatever hardware rounding may take effect; and,
  • certain floating point hardware exceptions (traps) for overflow or underflow may be triggered (though this is not generally the case; note: on x86 machines floating point exceptions may not be triggered but the resulting float may denormalized).

Nowhere in this function is there a check for whether the GMP number (the const mp_limb_t *arr) may be greater than the size of a double and either warn the user (possibly with an ArithException or round the resulting double toward zero.

Compare the problem section in __encodeDouble to the exponent check in the internal GMP function, mpn_get_d, that avoids overflow by comparing the relative magnitude of the GMP number with the maximum magnitude of the double:

   /* note: exp is a long int, given as an argument to min_get_d as 0L --PDT */
  /* Adjust exp to a radix point just above {ptr,size}, guarding against
     overflow.	After this exp can of course be reduced to anywhere within
     the {ptr,size} region without underflow.  */
  if (UNLIKELY ((unsigned long) (GMP_NUMB_BITS * size)
		> (unsigned long) (LONG_MAX - exp)))
    {
      if (_GMP_IEEE_FLOATS)
	goto ieee_infinity;

      /* generic */
      exp = LONG_MAX;
    }
  else
    {
      exp += GMP_NUMB_BITS * size;
    }

(If you want to see the full mpn_get_d function, it is in the file [toplevel gmp source]/mpn/generic/get_d.c .)

There is no check in the Haskell code using __encodeFloat or __encodeDouble, in libraries/base/GHC/Float.lhs. For example, the encodeFloat Haskell function under class RealFloat uses __encodeDouble directly:

encodeFloat (J# s# d#) e = encodeDouble# s# d# e

This is not the case with the safer fromRat function which ensures the Integer falls in the range of the mantissa (see the source code).

A replacement library for GMP might use the GMP strategies of including a special bitwise conversion (with appropriate masks) and a hardware-based version. An unconventional solution might perform the rounding manually (but with relatively portable predictability) using interval arithmetic.

As for printing GMP numbers, libraries/base/GHC/Num.lhs defines a special internal function jtos to handle showsPrec and showList for Integer as an instance of the class Show. The jtos function uses the primitive GMP-based function quotRemInteger and performs many conversions from Integer to Int. This is not as efficient as the internal GMP functions, especially for large numbers, because each conversion allocates extra storage for GMP as noted in Replacing GMP -- Optimisation Opportunities.

Attachments (1)

Download all attachments as: .zip