What are the definitions for >>= and return for the IO monad?

Question

After seeing how the List and Maybe monads are defined, I naturally became curious about how the operations >>= and return are defined for the IO monad.

ehird · Accepted Answer · 2012-02-11 23:43:29Z

There is no specific implementation for IO; it's an abstract type, with the exact implementation left undefined by the Haskell Report. Indeed, there's nothing stopping an implementation implementing IO and its Monad instance as compiler primitives, with no Haskell implementation at all.

Basically, Monad is used as an interface to IO, which cannot itself be implemented in pure Haskell. That's probably all you need to know at this stage, and diving into implementation details is likely to just confuse, rather than give insight.

That said, if you look at GHC's source code, you'll find that it represents IO a as a function looking like State# RealWorld -> (# State# RealWorld, a #) (using an unboxed tuple as the return type), but this is misleading; it's an implementation detail, and these State# RealWorld values do not actually exist at runtime. IO is not a state monad,¹ in theory or in practice.

Instead, GHC uses impure primitives to implement these IO operations; the State# RealWorld "values" are only to stop the compiler reordering statements by introducing data dependencies from one statement to the next.

But if you really want to see GHC's implementation of return and (>>=), here they are:

returnIO :: a -> IO a returnIO x = IO $ \ s -> (# s, x #) bindIO :: IO a -> (a -> IO b) -> IO b bindIO (IO m) k = IO $ \ s -> case m s of (# new_s, a #) -> unIO (k a) new_s

where unIO simply unwraps the function from inside the IO constructor.

It's important to note that IO a represents a description of an impure computation that could be run to produce a value of type a. The fact that there's a way to get values out of GHC's internal representation of IO doesn't mean that this holds in general, or that you can do such a thing for all monads. It's purely an implementation detail on the part of GHC.

¹ The state monad is a monad used for accessing and mutating a state across a series of computations; it's represented as s -> (a, s) (where s is the type of state), which looks very similar to the type GHC uses for IO, thus the confusion.

+1 for the great explanation for why State# is used to preserve statement order!
I disagree that IO is not a State Monad. It is only that the State that it carried around (commonly called RealWorld) is completely abstract. This fact no more disqualifies it as a true State Monad then would the abstract nature of the s-thread in the ST Monad.
@JohnF.Miller: The state monad model of IO can model sequential computation (even though IO is not implemented in that way in practice), but it cannot reasonably represent concurrency. Anyway, it's definitely more confusing than helpful, given that such a model of IO is much more restricted than a state monad (or you'd be able to do things like putting the state of the world back after performing some side-effects, reversing them).
The state-of-the-world model for IO is used explicitly in Mercury, even when concurrency is involved, and it works fine. You just have to view the IO actions as mapping the world before the action has happened to one in which the action has happened along with other effects. But this is necessary anyway in sequential computation, because other concurrent effects are happening all the time due to users, other programs, networks, hardware failures, etc, anyway.
Not quite. It works due to linear types and the fact that there is no way to create a value of the io type (other than receiving it ultimately from main). These points correspond exactly with the inabilities in Haskell's to examine the monadic context of an IO action, or to actually execute an IO action (other than the implicit execution of the main IO action). But I think we're mostly on the same page; I just don't agree that it isn't valid to interpret Haskell's IO monad as a state monad operating on "the world".

Jens Petersen · Accepted Answer · 2021-11-03 07:06:15Z

You will be disappointed, but the >>= in IO monad isn't that interesting. To quote the GHC source:

{- | A value of type @'IO' a@ is a computation which, when performed, does some I\/O before returning a value of type @a@. There is really only one way to \"perform\" an I\/O action: bind it to @Main.main@ in your program. When your program is run, the I\/O will be performed. It isn't possible to perform I\/O from an arbitrary function, unless that function is itself in the 'IO' monad and called at some point, directly or indirectly, from @Main.main@. 'IO' is a monad, so 'IO' actions can be combined using either the do-notation or the '>>' and '>>=' operations from the 'Monad' class. -} newtype IO a = IO (State# RealWorld -> (# State# RealWorld, a #))

That means IO monad is declared as the instance of ~~State~~ State# monad, ~~and its >>= is defined there (and its implementation is fairly easy to guess)~~.

See IO inside article on Haskell wiki for more details about the IO monad. It is also helpful to look at the Haskell docs, where every position has small "Source" link on the right.

Update: And there goes another disappointment, which is my answer, because I didn't notice the '#' in State#. However IO behaves like State monad carrying abstract RealWorld state

As @ehird wrote State# is compiler's internal and >>= for the IO monad is defined in GHC.Base module:

instance Monad IO where {-# INLINE return #-} {-# INLINE (>>) #-} {-# INLINE (>>=) #-} m >> k = m >>= \ _ -> k return = returnIO (>>=) = bindIO fail s = failIO s returnIO :: a -> IO a returnIO x = IO $ \ s -> (# s, x #) bindIO :: IO a -> (a -> IO b) -> IO b bindIO (IO m) k = IO $ \ s -> case m s of (# new_s, a #) -> unIO (k a) new_s

This isn't really true, though. That's State# not State, and as discussed elsewhere, the real magic going on in the IO monad is not captured by its operations on the state token.
GHC.IOBase has since been deleted in this commit. The mentioned definitions now seem to be in GHC.Base.

Clark Gaebel · Accepted Answer · 2012-02-12 01:34:52Z

1

They don't do anything special, and are just there for sequencing actions. It would help if you think of them with different names:

>>= becomes "and then, using the result of the previous action,"

>> becomes "and then,"

return becomes "do nothing, but the result of doing nothing is"

This turns this function:

main :: IO () main = putStr "hello" >> return " world" >>= putStrLn

becomes:

main :: IO () main = putStr "hello" and then, do nothing, but the result of doing nothing is " world" and then, using the result of the previous action, putStrLn

In the end, there's nothing magical about IO. It's exactly as magical as a semicolon is in C.

edited Feb 12, 2012 at 1:34

answered Feb 11, 2012 at 22:45

Clark Gaebel

18.3k21 gold badges70 silver badges94 bronze badges

6 Comments

Daniel Wagner Over a year ago

Sorry, but I downvoted. When something seems magical, merely saying it isn't magical isn't enough -- you have to pull aside the curtain and show how the magic trick worked.

Clark Gaebel Over a year ago

There's nothing magical about IO. It's exactly as magical as a semicolon is in C.

Daniel Wagner Over a year ago

But that's not at all true. Semicolons are a special case of the base syntax of C. The (>>=) and (>>) operators and the function return have no such special status -- they're defined in a library! The fact that you can, at least at first glance, define an impure fragment of the language as a library in an otherwise masochistically pure language surely looks a bit mysterious if you don't know what's going on.

augustss Over a year ago

IO is more magical than semicolon in C. If it were like semicolon you would not be able to, e.g., have IO actions in a list without executing them.

augustss Over a year ago

@leftroundabout But in switching from statements to functions you added the magic that makes IO more magical than semicolons. I'm not arguing that Haskell is more magic than C, just that IO has more to it than semicolon. (Even though Haskell has a little more magic than C; you can't define (>>=) for your function pointer representation. Well, at least not one that can be used more than a bounded number of times.)

|

Collectives™ on Stack Overflow

What are the definitions for >>= and return for the IO monad?

3 Answers 3

11 Comments

2 Comments

6 Comments

Linked

Hot Network Questions

Collectives™ on Stack Overflow

3 Answers 3

11 Comments

2 Comments

6 Comments

Linked

Related