Allow the user to prevent floating and CSE

mentioned in issue #8457

changed weight to 5

added Tfeature request Trac import labels

changed the description

Trac metadata

Trac field	Value
CC	MikolajKonarski, edsko, kopernikus → MikolajKonarski, edsko

That all looks possible. Since nofloat does several things, it may not be long before people start asking for variants that do some combination of its properties. But I guess we can jump that bridge if we come to it.

It would be useful to give some compelling use-cases.

Can I suggest a closely related idea, and also related to #9520 (closed)

data Pipe i o r = Yield o {-# NOUPDATE #-} (Pipe i o r)

This says we'll never do thunk updates on that field in that constructor. So similar idea (I believe) to oneShot lambdas.

Indeed we might need both no update on fields and oneShot, I'm not sure, e.g.:

data Pipe i o r = Yield o {-# NOUPDATE #-} (Pipe i o r)
                | Await   {-# NOUPDATE #-} (Either r i -> Pipe i o r)

-- smart constructor:
await f = Await (GHC.Magic.oneShot f)

What's all this for? For avoiding treating these control structures as data structures (which is what #9520 (closed) is all about).

Right, so a lot of the thinking that led to this ticket came from trying to understand memory leaks in conduit code. See my recent blog post http://www.well-typed.com/blog/2016/09/sharing-conduit/ where these issues are described in great detail; this should also serve, I hope, as one "compelling use case".

That said, I like the idea of a "noupdate" much better than a "nofloat". It would seem to me that its semantics would be easier to specify; and if it means I don't have to think so hard about what exactly the optimizer is doing to my code in order to understand why I do or do not have a memory leak, that would very welcome.

I really like @duncan 's suggestion of having a type annotation on a type; though we might also want some adhoc way of saying "make this thunk not-updateable". An easyish experiment perhaps might be to declare a magic datatype

data DontUpdate a = DontUpdate a

with the property that any code that looks at the thunk in the payload of DontUpdate doesn't cause that thunk to be updated. Then in @duncan 's example we could define

data Pipe i o r = Yield o (DontUpdate (Pipe i o r))

That said, I'm not sure exactly what DontUpdate should do for the lambda; but this is a question about @duncan's proposal too. I think what we want to happen is that the thunks in the function closure never get updated (this, in a nutshell, is what is causing memory leaks in conduit code; see the blog post); but that's already more magical than just saying "don't update this thunk".

Trac metadata

Trac field	Value
Related	- → #8457, #9520 (closed)

I think that "noupdate" would require some careful thought. What if I say

f x = if ... then Yield blah x
             else ...

Then the "noupdate" second field of Yield is just the parameter to f. Does the caller have to know not to build an updatable thunk. And why is updating so bad?

(Confession: I have not yet read Edsko's post. But I it should be possible to give a crisp explanation of what any language feature does in a standalone way.)

Right, this is an initial idea and hasn't been fleshed out. Thanks for the probing example :-)

So the intention is that it's a purely local thing. So in that example, the answer is no, we do not expect a caller far away to have to know anything. The idea is that evaluating "via" the noupdate field should not perform thunk updates, but I appreciate that may not match how thunk construction and update works.

So how about something like this...

Suppose the primitive is not on fields, but on let. This is by analogy with strict let !_ = versus strict constructor fields. The primitive with strictness is at use sites and a convenience for systematic use we can push it to constructor fields, which is defined in terms of constructor wrappers.

So suppose the primitive is let {-# NOUPDATE #-} x = ..., and so then the Yield constructor above could perhaps be defined with a wrapper like

data Pipe i o r = Yield o {-# NOUPDATE #-} (Pipe i o r)

yield o x = let {-# NOUPDATE #-} x' = x in Yield o x'

So in your f x example above then this would do very little (and indeed we'd want it to do precisely nothing different to the usual, by shorting out the extra let indirection). But if things are defined with Yield (expr) or locally ghc decides to float/push things in, then the expression would end up in the let {-# NOUPDATE #-} x' = ... and so there would be an effect.

I'm very glad to see full laziness getting some attention. I've been aware of its deleterious effects for some time and have tried to spread awareness of it:

I have even asked whether it is an optimization worth performing at all, though I conclude that it is:

https://stackoverflow.com/questions/35115172/why-is-full-laziness-a-default-optimization/35115664

The full laziness transformation causes a lot of headaches and something really needs to be done about it. However I do not think this suggestion is the right approach. Why not tweak the transformation so that it only fires in cases that are guaranteed not to lead to memory leaks? That could be as simple as only hoisting bindings of monomorphic non-recursive datatypes. The proposed nofloat keyword is just adding additional complexity over a transformation which itself is introducing too much complexity. I'm very concerned about the idea.

Replying to [ticket:12620#comment:125268 simonpj]:

I think that "noupdate" would require some careful thought. What if I say
f x = if ... then Yield blah x
else ...
Then the "noupdate" second field of Yield is just the parameter to f. Does the caller have to know not to build an updatable thunk.

I guess we would instruct the demand analysis to believe that Yield has strictness signature <L,U><L,1*U> and thus this once-used information will be propagated, at least to the extent possible.

Replying to [ticket:12620#comment:125277 tomjaguarpaw]:

I'm very glad to see full laziness getting some attention (...) I have even asked whether it is an optimization worth performing at all, though I conclude that it is:

https://stackoverflow.com/questions/35115172/why-is-full-laziness-a-default-optimization/35115664

Yup, I cite this in the blog post :)

However I do not think this suggestion is the right approach. (...) The proposed nofloat keyword is just adding additional complexity over a transformation which itself is introducing too much complexity. I'm very concerned about the idea.

I agree that it would be preferable not to "program the optimizer" when writing Haskell code. That's another reason in fact why I prefer noupdate over nofloat, beacuse actually noupdate goes beyond full laziness. Consider this example from the blog post:

retry :: IO a -> IO a
retry io = catch io (\(_ :: SomeException) -> retry io)

main :: IO ()
main = retry $ ni_mapM_ print [1..1000000]

This program has a memory leak, but it's nothing to do with full laziness here. Now admittedly we could turn this into a full laziness issue by giving the argument to retry a dummy unit argument or something like that, so that we write

retry :: (() -> IO a) -> IO a
retry io = catch (io ()) (\(_ :: SomeException) -> retry io)

main :: IO ()
main = retry $ \() -> ni_mapM_ print [1..1000000]

or something like that, but then you would have to do that in every single function that duplicates IO actions (think forever, replicateM_, etc.) Instead, we could mark that list as noupdate and the memory leak would be gone.

Edsko, it seems to me that the problem that you mention here is quite easy to avoid.

main :: IO ()
main = retry $ return () >>= \_ -> ni_mapM_ print [1..1000000]

is sufficient, unless I am very much mistaken. With such a construction the list is allocated afresh for each invocation of the IO action.

Fair enough, that's an easier workaround. But the idea is to have something a little more compositional. For example, in the case of conduits, we probably never want to share a conduit value. So it would be great if we could annotate the conduit constructors with a noupdate annotation, and then users of the conduit library don't have to worry about this problem anymore. After all, in the list example, it's not obvious that

main :: IO ()
main = retry $ runConduit someConduit

has a space leak; even less so when that retry and the runConduit are in different places:

go :: IO ()
go = runConduit someConduit

main :: IO ()
main = retry go

We'd need to have the foresight to write

main :: IO ()
main = retry $ return () >>= \_ -> go

The situation really is very close to strictness; do we want to make sure every single function using a datatype has the right seqs in the right place, or we just put some strictness annotations on the datatype?

It looks like that fix only works for the default -O0. Passing either -O1 or -O2 reintroduces retry's space leak

Edsko, I'm a bit puzzled. For the case of conduits, isn't it enough to hide things behind lambdas in the definition of the Pipe type?

Wren, sure, but Edsko's original claim is that this isn't a full laziness issue. My example brings it back to being a full laziness issue indeed. My contention is that even given Edsko's example it still makes more sense to fix the full laziness transformation than add a magic word.

Allow the user to prevent floating and CSE

Child items ...

Activity

Allow the user to prevent floating and CSE

Relates to

Activity