Isolated synchronous deinit #2

Nickolas_Pohilets · January 14, 2023, 6:56pm

I'm processing feedback from the first iteration:

So, in the updated proposal current behaviour will be preserved by default, and isolation will require explicit opt-in.

For actors, syntax would be isolated deinit {.

For GAIT, there are two options. One can similarly use attribute isolated to opt-in into inheriting global actor from the class. Alternatively one can explicitly repeat global actor declaration:

class MyView1: UIView {
    isolated deinit { ... }
}

class MyView2: UIView {
    @MainActor deinit { ... }
}

I think the first option is more preferable, because it follows DRY principle. So, it will be recommended in the proposal text, any updates to the Swift book should use it, and in case of ambiguity it will be chosen for fix-its.

It is an error to use isolated if there is no global actor isolation to inherit. It is an error to combine isolated attribute with nonisolated or global-actor isolation attribute.

I'm willing to push back on this. I think it is a premature optimization. I see value in having access to the task locals in the deinit. If object is referenced by several child tasks, you cannot know which one will make last release, so you cannot reliably use values set in child tasks. But as long as you are confident that object does not escape scope of the parent task, you can still reliably use values set by the parent task.

If task locals are used to inject logger/error tracker, it is easy to forget to reinject it in the deinit. This will cause errors occurring inside the deinit to be not visible in the dashboards. And if errors are rare and unexpected, both problems (error itself and broken monitoring) can remain undiscovered for a long time.

class ErrorLogger {
    @TaskLocal
    static var instance: ErrorLogger? = nil

    func log(_ error: Error) { ... }
}

@MainActor
class Resource {
    isolated deinit {
        shutdown()
    }
}

func shutdown() {
    do {
        ...
    } catch {
        ErrorLogger.instance?.log(error)
    }
}

And also note that performance cost for copying task locals occurs only when hopping is needed. If last release occurs on the right actor, no copying happens. Which I expect to be pretty common.

To support demand for blocking copying task-locals in isolated deinit, I can suggest to use an attribute:

@MainActor
class Resource {
    let errorLogger: ErrorLogger

    @detached
    isolated deinit {
        EventLogger.$instance.withValue(errorLogger) {
            shutdown()
        }
    }
}

Attribute @detached implies analogy between Task vs Task.detached. Which is a good analogy in regards to task-locals, but Task vs Task.detached also differs in priority inheritance. I don't recall seeing much demand for priority management for isolated deinit, so it might be better to use different attribute name to avoid overpromising.

If implementing blocking task-locals, I can also add public API for that, something like this:

public func withResetTaskLocalValues<R>(
    operation: () async throws -> R,
    file: String = #fileID, line: UInt = #line
) async rethrows -> R

And one could use this function to achieve blocking task-locals deinit as a functional requirement, but not for performance reasons:

// When hopping, task-locals are first copied
isolated deinit {
    // ... and then blocked
    withResetTaskLocalValues {
        ...
    }
}

As an alternative to @detached attribute, it is possible to provide a guaranteed optimization that recognises this pattern and blocks task-locals in the swift_task_deinitOnExecutor.

Async deinit is a big topic, I'm gonna make a separate thread for it.

ktoso · January 21, 2023, 3:02am

Hi Nickolas! Super happy that you've picked this up again!

I agree with this as the recommended spelling for a GAIT's deinit. Basically isolated and nonisolated operate as "isolate to self (or not)" (though we had some discussions about the nuances there with @kavon recently), so it makes sense for an isolated deinit {} to mean the same when defined in a concrete actor or any other type annotated with @GlobalActor

The latter I guess does not really have to be banned... if that would just work? No strong feels on that.

It is an error to use isolated if there is no global actor isolation to inherit. It is an error to combine isolated attribute with nonisolated or global-actor isolation attribute.

Agreed as well, sounds good.

Relying on a task local to be present in deinit has me very concerned. We should perform some benchmarks to see the impact here.

I really would like to be able to recycle tasks for the purpose of such deinit (and similar things). So whatever we come up with here really should not prevent such this optimization in the future. It is not the optimization right now that I'm concerned about.

I had to re-read this a few times to see what you mean... Specifically you mean that if we're going to cause the deinit while executing on the right actor already, then you'd run the deinit in-place. Yeah then by accident we don't have to copy, that's true...

But once we get to async deinits, we'll always have to, because the Task{} we kick off to perform a deinit is unstructured, i.e. we don't await for its completion inside the scope where deinit was "caused", so we always must copy locals if we say they are guarnteed to be seen in deinit. Visualized:

func x() {
  let x = X()
  // Task { await x.deinit() }
  // no await on the deinit means that we must copy the task-locals
}

Perhaps it's worth writing up a matrix of all the cases and expected behaviors so we can see if we can come to an agreement here?

The "reset task locals" function can potentially be useful in any case; that's a nice thing we might want to consider. Implementation wise we can do this very nicely by inserting a tombstone into the chain so lookups simply stop searching once they notice it

John_McCall · February 21, 2023, 9:53pm

@Nickolas_Pohilets, I know you've asked for more feedback from the Language Workgroup on this idea, as well as your ideas on async deinit. Is this the right thread for that, or is there a better one?

Nickolas_Pohilets · February 23, 2023, 12:44pm

If it is about sync deinit - attributes, task local behaviour or other topics, then this thread is a good place.

If it is about async deinit, these two might be a better topic:

John_McCall · February 28, 2023, 2:26am

Sorry for the continued slow turnaround. I know that one of the issues hanging over this is whether to copy task locals. Is there a longer discussion of that somewhere, or is this thread pretty much it?

Part of my concern about copying task locals is that it's extremely brittle to depend on any kind of thread/task context being present when the last release happens. Unless the object is actually non-Sendable, it's very easy for references to it to escape to other contexts. In particular, @MainActor classes are Sendable and by design can be passed around outside the main actor as long as their methods and properties aren't used there. If that happens even once for an object, it's almost guaranteed that there's a potential race where that reference will be the last one released. It's one thing to optimize based on the assumption that that's unlikely, it's another to encourage people to write code that will break if it ever happens.

Nickolas_Pohilets · February 28, 2023, 1:20pm

That's pretty much it.

I expect task locals in deinit to be useful in two scenarios:

[Mostly] When value is set on a root of the tree of tasks. You don't know in which of the leaf tasks last release will happen, but you don't care. Because values you are using are present in the entire tree.
[Occasionaly] For debugging, or other purposes, you want to break the abstraction and actually discover in which task last release happened.

func example() async {
    ErrorHandler.$current.withValue(FirebaseErrorHandler()) {
        let object = WithDenit()
        async let f = foo(object)
        let task = Task {
            bar(object)
        }
        print(await f, await task.value)
    }
}

class WithDeinit {
    @MainActor
    deinit {
        // Ok to use ErrorHandler.current
        // No matter if last release happens in the parent task, async-let
        // or a non-structured task, task-local value will be available
    }
}

Using task-locals, thread-specifics or queue-specifics in the deinit indeed can be brittle, but that's already the case for regular reinit's. I see isolation of the deinit as an orthogonal matter. If we think that this a major problem, then probably we should discuss disabling task locals (and maybe thread-specifics and queue-specifics too) for all deinits. If not - then I think adding isolation to the deinit should not silently introduce another behavioural changes.

ktoso · February 28, 2023, 2:43pm

I think that would be highly unreliable. And in any case, too unreliable to build any real APIs around requiring this. I really would not want to have APIs which would crash on "released on wrong task", that sounds like a nightmare to hunt down.

I'm looking at this as someone designing the distributed tracing API, where very much you'd like to have a span "released in the right context", but since this would be unreliable and one could pass around a span or whatnot, this cannot serve as tool to implement such API, and we still need an explicit defer { span.end() } or withSpan {} APIs instead.

Task locals are a bad tool for this though; Since there is going to be programatic access to backtraces that would be the right way to debug such things, wouldn't it: [Pitch] Swift Backtracing API

And I'll once again bring up the future work topic where we were worried about giving too strong promises potentially preventing future optimizations. Like @kavon mentioned, spinning up tasks for every deinit is expensive, so we could try to cheat and reuse tasks someday but to do this efficiently, we should not guarantee any task local values in those deinitializers IMHO.

I'm not so concerned about a potential difference here to be honest... but it could be considered: adding a "barrier" to lookups is cheap and could be done before a deinit runs... but still, we'd have todo this before every deinit then? The cheapest and predictable thing I thought was "just leave nonisolated inits alone" and "don't dopy to ones where we need to spin up a task". Either way, one should not rely on the presence of locals in a deinit, and if you got them "lucky!" but don't rely on it in any way.

Nickolas_Pohilets · February 28, 2023, 7:31pm

That's a good point.

BTW, ideally when viewing stack trace for isolated deinit that hopped in the debugger, I'd like to show stack to the point where release did happen. Similar to how it works for enqueued GCD blocks.

If I understood correctly, in case of GCD, this is achieved by interaction between:

[open-sourced] MacOSX SystemRuntime plugin for LLDB - thin wrapper around libBacktraceRecording
[proprietary?] libBacktraceRecording.dylib - implements hooks that record stack trace when block is enqueued
[open-sources] libdispatch.dylib - provides API to install hooks.

Since libBacktraceRecording is proprietary, I won't be able to contribute to it, but should I at least provide some hooking API that it could use?