The False Hope of Managing Effects with Tagless-Final in Scala

Tagless-final is a technique originally used to embed domain-specific languages into a host language, without the use of Generalized Algebraic Data Types.

In the Haskell community, tagless-final still refers to a way of creating polymorphic programs in a custom DSL that are interpreted by instantiating them to a concrete data type. In the Scala community, however, tagless-final is used almost exclusively for monadic, effectful DSLs. Usage of the term in Scala is closest to what Haskeller’s mean by MTL-style, but without the algebraic laws that govern MTL type classes.

In Scala, tagless-final has become almost synonymous with types of kind * -> *, leading to the infamous F[_] higher-kinded type parameter that is so pervasively associated with the phrase tagless-final.

In this post, I will argue that the many claims made about tagless-final in Scala are not entirely true, and that the actual benefits of tagless-final come mostly from discipline, not from so-called effect polymorphism.

After this detailed analysis, I will conclude the post by providing a list of concrete recommendations for developers who are building functional Scala libraries and applications.

Tagless-Final 101

In Scala, tagless-final involves creating type classes, which describe capabilities of a generic effect F[_].

Note: There is an alternate, and (I’d argue) superior encoding of tagless-final that involves not type classes, but effect-polymorphic interfaces, which are passed as ordinary parameters; but this alternate encoding doesn’t substantially change my arguments, so I won’t discuss it here.

For example, we can create the following type class to describe console-related capabilities of some effect F[_]:

trait Console[F[_]] {
  def putStrLn(line: String): F[Unit] 
  val getStrLn: F[String]
}

Or, we could create the following type class to describe persistence capabilities for User objects:

trait UserRepository[F[_]] {
  def getUserById(id: UserID): F[User]

  def getUserProfile(user: User): F[UserProfile] 

  def updateUserProfile(user: User, profile: UserProfile): F[Unit]
}

These type classes allow us to create methods that are polymorphic in the effect type F[_], but which have access to required capabilities. For example, we could describe a program that uses the Console interface as follows:

def consoleProgram[F[_]: Console]: F[Unit] = 
  implicitly[Console[F]].putStrLn("Hello World!")

Combined with type classes like Monad (which allows chaining effects), we can build entire programs using the tagless-final approach:

def consoleProgram[F[_]: Console: Monad]: F[String] = {
  val console = implicitly[Console[F]]

  import console._

  for {
    _     <- putStrLn("What is your name?")
    name  <- getStrLn
    _     <- putStrLn(s"Hello, $name, good to meet you!")
  } yield name
}

Because such programs are polymorphic in the effect type F[_], we can instantiate these polymorphic programs to any concrete effect type that provides whatever they require.

For example, if we are using ZIO Task (a type alias for ZIO[Any, Throwable, A]), we can instantiate our program to this concrete effect type with the following code snippet:

val taskProgram: Task[String] = consoleProgram[Task]

Typically, the instantiation of a polymorphic tagless-final value to a concrete effect type is deferred as long as possible, preferrably to the entry points of the application or test suite.

With this introduction, let’s talk about the reasons you might want to use tagless-final…the pitch for the tagless-final technique, if you will.

The Pitch for Tagless-Final

Tagless-final has a seductive pitch that appeals to every aspiring functional programmer.

In the functional programming community, we are taught (with good reason) that monomorphic code can’t be reused much, and leads to more bugs.

We are taught that generic code not only enables reuse, but it pushes more information into the types, where the compiler can help us verify and maintain correctness.

We are taught the principle of least power, which tells us that our functions should require as little as necessary to do their job.

I have helped develop, motivate, and teach these and other principles in my Functional Scala workshops, helping train new generations of Scala developers in the functional way of thinking and developing software.

In this context, functional programmers are primed for the tagless-final pitch; I know this, because I have given the tagless-final pitch, and even helped craft its modern day incarnation.

In one video, I unintentionally convinced several companies to adopt tagless-final, despite an explicit disclaimer stating the techniques would be overkill for many applications!

In the next few sections, I’m going to give you this pitch, and try to convince you that tagless-final is the best thing ever. Moreover, I’m going to use only arguments that have an element of truth.

Ready? Here we go!

1. Effect Type Indirection

As of this writing, there are several mainstream effect types, including ZIO, Monix, and Cats IO, all of which ship with Cats Effect instances, and which can be used more or less interchangeably in libraries like FS2, Doobie, and http4s.

Tagless-final lets you insulate your code from the decision of which effect type to use. Rather than pick one of these concrete implementations, using tagless-final lets you write effect type-agnostic code, which can be instantiated to any concrete effect type that provides Cats Effect instances.

For example, our preceding console program can just as easily be instantiated to Cats IO:

val ioProgram = consoleProgram[cats.effect.IO]

With tagless-final, you can defer the decision of which effect type to use indefinitely (or at least, to the edge of your program), isolating your application from changes in an evolving ecosystem.

Tagless-final lets you future-proof your code!

2. Effect Testability

Tagless-final, because it provides a strong layer of indirection between your application, and the concrete effect type that models effects, enables your code to be fully testable.

In the preceding console implementation, it is easy to define a test instance of the Console type class:

final case class ConsoleData(input: List[String], output: List[String])

final case class ConsoleTest[A](run: ConsoleData => (ConsoleData, A))
object ConsoleTest {
  implicit def ConsoleConsoleTest: Console[ConsoleTest] = new Console[ConsoleTest] { 
    def putStrLn(line: String): ConsoleTest[Unit] = 
      ConsoleTest(data => (data.copy(output = line :: data.output), ()))

    val getStrLn: ConsoleTest[String] =
      ConsoleTest(data => (data.copy(input = data.input.drop(1)), data.input.head))
  }
}

Now, assuming we define an appropriate Monad instance for our data type (which is easy to do!), we can instantiate our polymorphic consoleProgram to the new type:

val testProgram = consoleProgram[ConsoleTest]

Tada! We can now unit test our console program with fast, deterministic unit tests, thereby reaping the full testability benefits of pure functional programming.

3. Effect Parametric Reasoning

Parametric reasoning in statically-typed functional programming circles refers to the ability for us to reason generically about the implementation of a polymorphic function merely by looking at its type.

For example, there is one possible implementation of the following function:

def identity[A](a: A): A = ???

(Assuming no reflection, exceptions, or use of null.)

Parametric reasoning allows us to save time when we are studying code. We can look at the types of a function, and even if we don’t know the exact implementation, we can place bounds on what the function can do.

Parametric reasoning lets us more quickly and more reliably understand code bases, which is critical for safe maintenance of those code bases in response to new and changing business requirements.

Moreover, parametric reasoning can reduce the need for unit testing: whatever is guaranteed by the type, does not need to be tested by unit tests. Types prove universal properties across all values, so they are strictly more powerful than tests, which prove existential properties across a few values.

Since tagless-final is an example of (higher-kinded) parametric polymorphism, we can leverrage parametric polymorphism for effect types too.

For example, take the following code snippet:

def consoleProgram[F[_]: Console: Applicative]: F[String] = ???

Although we don’t know what the implementation of this function is without looking, we know that it requires F[_] to provide both Console and Applicative instances.

Because F[_] is only Applicative and not Monad, we know that although consoleProgram can have a sequential chain of console effects, no subsequent effect can depend on the runtime value of a predecessor effect (that capability would require bind from Monad).

We would not be surprised if we saw the implementation were as follows:

def consoleProgram[F[_]: Console: Applicative]: F[String] = {
  val console = implicitly[Console[F]]

  import console._

  putStrLn("What is your name?") *> getStrLn
}

By constraining the capabilities of the data type F[_], tagless-final lets us reason about our effectful programs parametrically, which lets us spend less time studying code, and less time writing unit tests.

The Closer

As we have seen, tagless-final is an incredible asset to functional Scala programmers everwhere.

Not only does it enable abstraction over effects, so programmers can change their mind about which effect type to use, but the technique gives us full testability of effectful programs, and helps us reason about our effectful programs in new ways, which can reduce the need for studying code and cut down on unit tests.

The Fine Print

Sold yet on tagless-final? I am!!! Well, somewhat, anyway.

There’s an element of truth in every argument that I’ve made. Although as you might suspect, a more nuanced, sophisticated look reveals a less positive picture of tagless-final.

Let’s take a look at all the fine print in the next few sections.

1. Premature Indirection

It is absolutely true that adding a layer of indirection over an effect type can reduce the cost of switching to a different effect type, assuming similar underlying semantics.

It is also true that adding a layer of indirection over Spark, Akka https, Slick, or a database-specific dialect of SQL, might reduce the cost of switching to different technologies.

In my experience, however, the attempt to proactively add layers of indirection without a clear business mandate to do so, simply to mitigate the possible cost of future change, is an example of premature indirection.

Premature indirection rarely pays for itself.

In most cases, overall application costs would be substantially lower picking a specific technology, and then, if needs change, simply refactoring the code to a new technology.

The cost of refactoring from one effect type to another is not related to the cost of changing types from IO[_] to Task[_], or swapping one set of methods for another. Rather, it is related to the semantic differences between operations on these effect types. Yet a layer of indirection helps us only when semantic differences are relatively small. If they are small, then the cost of refactoring is relatively low.

A refactoring from one effect type to another only needs to happen once, and only if actually necessary (which it might not be). But the cost of coding to a layer of indirection has to be paid indefinitely, and it must be paid regardless of whether or not the indirection will ever be used.

Beyond the cost of coding to an indirection layer that may never be used, there are substantial opportunity costs to premature indirection. In the case of ZIO, for example, the core effect type has hundreds of additional operations that are not available on a polymorphic F[_].

These methods, like many additional methods on Monix Task, are discoverable by IDEs; their types guide users to correct solutions; they have great inline Scaladoc; error messages are concrete and actionable; and type-inference is nearly flawless.

If you commit to not committing, you’re stuck with the weakest F[_], which means much of the power of an effect system remains inaccessible. The frustration of knowing a method is right there, but just out of reach, has prompted many to introduce custom type classes designed to access specific features of the underlying effect type.

The opportunity cost is even greater for ZIO than Monix, because ZIO features polymorphic reader and error effects, which provide new operations and allow parametric reasoning about dependencies and error handling, and which are currently unsupported in Cats Effect (the leading library for effect indirection).

In my opinion, only library authors have a compelling argument for effect type indirection. In order to maximize market share (which is critical for the adoption, retention, and growth of open source libraries), they need to support all major effect types, or risk losing market share to the libraries that do.

While this is a compelling argument for open source libraries, it is completely inapplicable to the closed source applications that make up the majority of Scala software development.

2. Untestable Effects

Tagless-final provides a path to testability, but tagless-final programs are not inherently testable. In fact, they are testable only to the degree their tagless-final type classes are testable.

For example, while we can create a testable version of the Console type class, many others, including popular type classes in the tagless-final community, are inherently untestable.

The majority of applications written in the tagless-final style make heavy use of a type class in Cats Effect called Sync, or its more powerful versions, including Async, LiftIO, Concurrent, Effect and ConcurrentEffect.

These type classes are designed to capture side-effects—for example, random number generation, API calls, and database queries. Any code that utilizes these type classes, or others like them, cannot be unit tested even in theory, because it interacts with partial, non-deterministic, and side-effecting code.

While such code can be tested with integration and system tests, any code at all can be tested with integration and system tests, including the worst possible procedural code!

Ultimately, the testability of tagless-final programs requires they code to an interface, not an implementation. Yet, if applications follow this principle, they can be tested even without tagless-final!

Here is a monomorphic version of the Console type class, for example:

trait Console {
  def putStrLn(line: String): Task[Unit] 
  val getStrLn: Task[String]
}

If your program uses this interface to perform console input/output, then it can be tested, even with a monomorphic effect type. The key insight is that your program must be written to an interface, rather than a concrete implementation. Unfortunately, numerous widely-used tagless-final interfaces (like Sync, Async, LiftIO, Concurrent, Effect, and ConcurrentEffect) encourage you to code to an implementation.

Using tagless-final doesn’t provide any inherent benefits to testability. The testability of your application is completely orthogonal to its use of tagless-final, and comes down to whether or not you follow best practices—which you can do with or without tagless-final.

3. No Effect Polymorphic Reasoning

The most insidious claim that tagless-final makes is that it provides effect polymorphic reasoning.

According to this claim—which is not entirely without merit—we can look at a polymorphic function signature, and know the effects it performs merely by examining constraints.

Previously, we looked at the following example:

def consoleProgram[F[_]: Console: Applicative]: F[String] = ???

I argued we could know something about what this function does by observing its use of Applicative and Console. In theory, these constraints provides us the ability to partially understand the behavior of the function without examining the implementation.

Unfortunately, this argument rests on a premise that is false. Namely, it rests on the premise that implicit parameters somehow constrain the side-effects executed or modeled by the function.

Scala does not track side-effects, no matter how much we wish otherwise. We can perform console effects anywhere, even without the provided Console[F], as shown in the following snippet:

def consoleProgram[F[_]: Applicative]: F[String] = {
  println("What is your name?")
  val name = scala.io.StdIn.readLine()
  Applicative[F].pure(name)
}

Not only can we perform any effects anywhere inside the body of the method without even so much as a compiler warning, but we can embed these effects into any Applicative functor, even one with a strict definition of pure / point:

def consoleProgram[F[_]: Applicative]: F[String] =
  Applicative[F].pure(()).map { _ =>
    val _ = println("What is your name?")
    scala.io.StdIn.readLine()
  }

This is legal, type-safe Scala code, and variations of this code appear pervasively in real world Scala. Not even the most aggressive IntelliJ IDEA lint settings will report any problems with this code snippet (except perhaps a warning about the use of higher-kinded types).

In tagless-final, “constraints”—parameter values like Applicative[F] or Console[F]—do not actually constrain, they unconstrain, giving us more ways of constructing values. And in Scala, because side-effects are already totally unconstrained, adding more values to parameter lists doesn’t change anything.

Stated more forcefully and for Scala code, effect parametric reasoning is a lie.

Now, as highly-skilled and highly-disciplined functional programmers, we can agree amongst ourselves to reject merging such code into our projects. We can inspect every line of code in our application to ensure that side-effects are captured only in “appropriate” places, where we all agree on some definition of appropriate.

A social contract, powered by good-will, skill, discipline, and indoctrination of new hires, can be quite useful. Nearly all best practices are social contracts, and they undoubtedly help us write better code. But social contracts should not be confused with compile-time constraints.

If we assume a working social contract to restrict side-effects in Scala, then we can assume restrictions on the effects in the preceding definition of consoleProgram, but these restrictions do not come from effect polymorphism (which does not exist in a language without effect tracking!). Rather, the restrictions come from diligence and discipline in enforcing the social contract—the line-by-line review of every new pull request.

If we are going to rely on a social contract to give us reasoning benefits, however, then we should consider other social contracts, such as always code to an interface, not an implementation.

Other social contracts can give us the exact same “reasoning benefits”, but with potentially better ergonomics. Stated differently, “reasoning benefits” is not a reason to prefer tagless-final over any other approach, because those benefits derive only from discipline (not from effect polymorphism), and discipline works for other approaches too.

These are hard limits on effect parametric polymorphism in Scala, which—short of inventing a compiler plug-in that overlays a new, effect-tracked language onto Scala—cannot be circumvented.

4. Sync Bloat

Even if we ignore the fact that effect parametric polymorphism doesn’t exist in Scala (reasoning benefits come from discipline, not from tagless-final), we have a major problem that I term sync bloat.

In Scala, the Cats Effect type class hierarchy provides many type classes that are explicitly designed to capture side-effecting code. These type classes include Sync, Async, LiftIO, Concurrent, Effect, and ConcurrentEffect.

Methods that require one of these type classes can literally do anything they want, without constraints, even assuming a working social contract. These methods nullify the power of discipline, depriving us of any benefits to reasoning and testability, resulting in opaque blobs of side-effecting, untestable procedural code.

Nearly all tagless-final code (including some of the best open source functional Scala I know of) makes liberal use of these type classes, freely embedding side-effects in numerous methods sprawled across the code base.

In a perfect world, perhaps programmers would create hundreds or thousands of fine-grained, testable type classes to represent separate concerns. But in the real world, the vast majority of programmers using tagless-final (even high-skilled, expert-level functional programmers!) are not doing this. Instead, they’re requiring type classes like Sync that encourage embedding arbitrary side-effects everywhere.

The pervasive phenomenon of sync bloat means that even if we have a working social contract, powered by discipline and diligence, we still aren’t gaining any benefits of effect parametricity.

5. Fake Abstraction

As I teach in Functional Scala, abstractions are sets of operations that satisfy algebraic laws, encoded using type classes. Abstractions allow us to describe common structure across a range of distinct data types.

Abstraction is the means by which we can write principled polymorphic code.

Without lawful operations, there is no abstraction, only layers of indirection masquerading as abstraction.

As practiced in Scala, tagless-final encourages a form of fake abstraction. To see this, let’s take a closer look at the Console[F] type class defined previously:

trait Console[F[_]] {
  def putStrLn(line: String): F[Unit] 
  val getStrLn: F[String]
}

This trait has two operations, called putStrLn and getStrLn. We can provide implementations of these two different operations for many different data types.

Unfortunately, these operations satisfy no algebraic laws—none whatsoever! This means when we are writing polymorphic code, we have no way to reason generically about putStrLn and getStrLn.

For all we know, these operations could be launching threads, creating or deleting files, running a large number of individual side-effects in sequence, and so on.

This contrasts quite dramatically with type classes like Ord, which we can use to build a generic sorting algorithm that will always be correct if the algebraic laws of Ord are satisfied for the element type (this is principled functional programming).

The implications of this are profound. Consider the following type signature:

def consoleProgram[F[_]: Console: Applicative]: F[String] = ???

Because Console[F] is lawless, we cannot reason generically about the side-effects modeled by the returned F[Unit]. These side-effects are unconstrained precisely because there are no algebraic laws that govern the operations provided to us by Console[F].

Two different implementations of Console[F] could do totally different things, without breaking any laws, so we cannot actually reason generically about the correctness of code using Console[F]. Our code may be correct for some Console[F], and incorrect for other Console[F].

Keep in mind that Console[F] is just a toy example. A realistic tagless-final type class would be far larger and more complex, containing numerous operations, which are ad hoc, unlawful, and impossible to reason about generically. If we write code using this type class, it will be highly coupled to unspecified implementation details.

Real generic reasoning applies only to real abstractions, like Monoid, Monad, and type classes from functional programming. The moment we introduce ad hoc, unspecified, implementation-specific operations like putStrLn or getStrLn, we can no longer reason generically about the behavior of our code in any principled way.

What this means is that even if we grant that Scala has effect parametric polymorphism (which it doesn’t!), and even if we assume that developers won’t use type classes like Sync (which they will!), the ad hoc nature of tagless-final type classes means we don’t actually have useful generic reasoning across these type classes.

If only Applicative[F] is required, we can tell that bind is not used, but we cannot tell how many individual side-effects are chained together to make up a single operation like putStrLn. The limited reasoning we can do is useless (a parlor trick, at best!), precisely because of all the reasoning we cannot do.

Does putStrLn print a line of text to a console? Or does it launch a multithreaded main function with the whole application? Who knows. The types and laws don’t tell you anything.

Adding Console[F] to a type signatue is at best a prayer that whoever gives us a Console[F] will abide by an unspecified contract that has something to do with console input/output and will make our “generic” code work correctly.

Polymorphic code that we can reason about generically is an awesome benefit of statically-typed functional programming. But generic reasoning requires abstractions, which must always have algebraic laws. The moment we create fake abstractions (operations without laws), we aren’t doing principled functional programming anymore.

The Tagless-Final Hit List

Tagless-final in Scala doesn’t entirely live up to the hype, as I’ve argued in this post:

Premature Indirection. For applications, using tagless-final to guard against the possibility of changing effect types is usually overengineering (premature indirection). Your application doesn’t add indirection layers for Akka Streams, Slick, Future, or even the dialect of SQL you’re using—because in most cases, the cost of building and maintaining that layer of indirection is unending. In the case of tagless-final effects, you also deprive yourself of many lawful operations and added type safety.
Untestable Effects. Many popular tagless-final type classes encourage capturing side-effects. Even if we are disciplined and diligent, these type classes destroy our ability to reason about and unit test applications built using them. In the end, testability is not a property of tagless-final code; testability requires you code to an interface, which you can do with or without tagless-final.
No Effect Parametric Polymorphism. Scala doesn’t constrain side-effects, and implicit parameters don’t change this. If you want to constrain side-effects, you need a social contract, enforced by discipline and diligence. In this case, reasoning benefits (such as they are) come only from painstaking review of every line of code, not from effect polymorphism. Yet other approaches that require discipline, like ensuring programmers only code to an interface, can provide similar benefits, but without the drawbacks of tagless-final.
Sync Bloat. In Scala, real world tagless-final code tends not to use custom type classes, but rather, a few type classes that allow the unrestricted capture of side-effects. These Sync code bases do not confer any benefits to reasoning or testability, even if we are disciplined and diligent during code review. They combine the ceremony and boilerplate of tagless-final with the untestable, unreasonable nature of the worst procedural code.
Fake Abstraction. Reasoning about generic code requires abstractions, which come equipped with algebraic laws. Algebraic laws precisely define common structure across different data types, and they let us reason generically about the correctness of polymorphic code. Yet most tagless-final type classes do not have any laws. Any code that uses “fake abstractions” is not actually generic, but instead, is closely wedded to unspecified implementation details.

Beyond just not living up to the hype, tagless-final has a number of serious drawbacks:

Tagless-final has significant pedagogical costs, because of the huge number of concepts it requires a team to master (parametric polymorphism, higher-kinded types, higher-kinded parametric polymorphism, type classes, higher-kinded type classes, the functor hierarchy, etc.).
Tagless-final has significant institutional costs, because of the level of ceremony and boilerplate involved (type classes, type class instances, instance summoners, syntax extensions, higher-kinded implicit parameter lists, non-inferrable types, compiler plug-ins, etc.).

I’ve talked about these drawbacks at length in the past, and I encourage readers to investigate for themselves the drawbacks of tagless-final in Scala.

Objections

Some functional programmers, when presented with these drawbacks, immediately counter with the objection that other approaches (including the reader monad) can’t constrain side-effects in Scala, either.

This is true, but also entirely beside the point.

There are zero approaches to statically constraining effects in Scala, because Scala cannot statically constrain effects. This means that when comparing two different approaches to managing effects in Scala, there is no dimension for “constraining effects”.

You can combine tagless-final with manual inspection of every line of code in an application, to ensure it satisfies the social contract that side-effects will only be executed and captured in “approved” places. This combination, which is powered by discipline (not effect polymorphism), provides both testability and reasoning benefits.

Similarly, you can combine the reader monad with manual inspection of every line of code in an application, to ensure it satisfies a social contract that logic will be written to interfaces, not implementations. As with tagless final, this combination is powered by discipline, and provides both testability and reasoning benefits.

Both tagless-final and the reader monad (and many other approaches) can indeed provide “guarantees” about effects, but it’s not really the techniques that are providing the guarantees, but the programmers who are manually reviewing and merging every line of code. These “guarantees” come from discipline, not from the Scala compiler.

Take the following snippet of tagless-final:

def console[F[_]: Monad: Console]: F[Unit] = ???

The only way we “know” this function does not interact with databases or perform random number generation or do anything else is if we or our colleagues have inspected every line of code, and manually certified that it doesn’t break our social contract about where side-effects can be run and modeled.

The compiler doesn’t provide any assistance with this, and this “knowledge” is not related to effect polymorphism.

Similarly, take the following snippet of monomorphic code:

def console(c: Console): Task[Unit] = ???

The only way we “know” this function does not interact with the database or perform random number generation or do anything else is if we or our colleagues have inspected every line of code, and manually certified that it doesn’t break our social contract about coding to interfaces, and not to implementations.

As with taglesss-final, the compiler doesn’t provide any assistance. Both these approaches and others to reasoning about side-effects come down to social contracts, powered by discipline and vigilence.

Another vague objection I have heard is that tagless-final is somehow more principled than other approaches, or that it somehow lends itself better to designing composable functional APIs.

In reality, principled, composable, functional code has everything to do with whether the operations of your domain satisfy algebraic laws. As practiced in Scala, tagless-final has no relation to principled functional programming.

The use of higher-kinded parametric polymorphism may provide an illusion of “rigor”, but it’s just a thin veneer on what is typically unprincipled imperative code. All the effect type polymorphism in the world can’t change this.

Recommendations

In light of this analysis of tagless-final, I have some concrete recommendations for different situations:

Stay the Course. If you’re in a small, stable, high-skilled team that’s already using and benefiting from tagless-final due to a working social contract, then keep using tagless-final. Long-time functional programmers may “forget” how to program procedurally and instinctively confine themselves to a functional subset of Scala, which makes enforcement of the contract easier. Consider abandoning tagless-final (or building a compiler plug-in) if the team starts scaling.
Build a Library. If you’re building an open source library that doesn’t take advantage of effect-specific features, then use an existing layer of indirection, such as Cats Effect. Or if you want to keep dependencies small, create a tiny layer of indirection for just the features you need. This layer may not help you reason about or test code, but it will help you support the full functional Scala market, which will improve adoption of your library.
Ditch Tagless-Final. In all other cases, I recommend picking a concrete effect type (ZIO, Cats IO, or Monix). If you decide to switch later, you’ll pay a one-time cost for the (straightforward) refactor. Encourage coding to interfaces, not implementations. This social contract doesn’t scale either, but at least many Java developers are already indoctrinated in the practice. This will give you testability, it can be done incrementally, and it can give you the same (discipline-powered) reasoning benefits as tagless-final, if employed to the same extent.
Minimize Effectful Code. Consider minimizing effectful code. Look for real abstractions in your domain, which are equipped with algebraic laws that help you reason generically about polymorphic code. Prefer declarative code instead of imperative (monadic) code. Don’t presume your State monad code is any better than its IO equivalent (it’s not). Prefer data types whose operations have denotational semantics.

Summary

Tagless-final has a brilliant sales pitch. It promises to future-proof our code to changes in concrete effect types. It promises us testability. It promises us the ability to reason about effects using parametric polymorphism.

Unfortunately, the reality of tagless-final doesn’t live up to the hype:

Tagless-final does insulate us from an effect type, but that’s a maintenance burden and deprives us of useful principled operations and type-safety.
Tagless-final doesn’t provide us any testability, per se, and many common type classes prevent testability; it’s only coding to an interface that provides us with testability, which can be done with or without tagless-final.
Tagless-final doesn’t constrain effects, since Scala has no way to restrict side-effects; type signatures alone cannot tell us which side-effects are executed or modeled by a method.

Beyond these drawbacks, real world tagless-final is littered with sync bloat, which can’t help us with unit testing or reasoning even if we are disciplined and diligent about restricting side-effects.

Further, since most tagless-final type classes are completely lawless, we can’t reason generically about code that uses them. True generic reasoning requires abstractions, defined by lawful sets of operations, and tagless-final doesn’t give us abstractions, only collections of ad hoc operations with unspecified semantics.

Tagless-final, far from being a pancea to managing functional effects, imposes great pedagogical and institutional costs. Many claim the technique renders Scala code bases impenetrable and unmaintainable. While an exaggeration, there is no question that the learning curve for tagless-final is steep, and the ergonomics of the technique are poor.

Ultimately, in my opinion, the benefits of tagless-final do not pay for the costs—at least not in most cases, and not with the current Scala compiler and tooling.

Small, stable teams that are already using tagless-final with a working social contract should probably keep using tagless-final, at least if they have found the benefits to outweigh the costs (and some teams have).

Developers of open source libraries can surely benefit from a layer of indirection around effect types, because even though indirection may not help with reasoning or testability, it can increase addressable market share.

Finally, other teams should probably avoid using tagless-final for managing effects, and embrace the age-old best practice of coding to an interface. While still just a social contract that relies on discipline, the practice is widely known, doesn’t require any fancy training, and can be used with more ergonomic approaches to testability and reasoning, including traditional dependency injection, module-oriented programming, and the reader monad.

John A De Goes