Concurrency 101

dima_kozhinov · August 27, 2024, 1:13pm

Good to know, thanks.

dima_kozhinov · August 27, 2024, 1:18pm

Sure, everyone should use correct terminology, even mere mortals of us, who are not concurrency experts. Thanks for pointing out.

There is no doubt that Swift concurrency is great and all, but to be honest I still not confident about using it. When I create a thread, I know what I'm doing. When I use async/await, I... well, not quite. This thing is cooperative, but creates an illusion of parallelism. I even initially (and mistakenly) thought that this thing is purely thread based.

Maybe I lived under a rock, as Swift concurrency is here for several years already, and only now I started to evaluate it. Maybe that's because majority of my programming work is not Swift related and I had no time to learn new things in Swift.

And - about new things - waiting for Swift 6, not without a bit of fear

crontab · August 27, 2024, 5:34pm

It is not entirely cooperative, it's mixed. There are real threads, normally as many as there are CPU cores, and there's cooperative switching too. This is similar to many modern server architectures like that of nginx's (and I think nginx was one of the pioneers of this architecture in the mainstream).

Basically Swift's structured concurrency frees you from thinking when to use cooperative vs. real switching, it does it for you automatically, but like with any software abstraction, you won't be able to fully benefit from it without knowing how it works under the hood. Plus it's too easy to abuse it too without knowing the inner mechanisms. The general principle is that Swift knows when things should be serialized and executes them cooperatively (e.g. within the same actor/isolation), whereas in all other cases it will try to use true parallelism. That's in a few words.

dima_kozhinov · August 27, 2024, 6:40pm

Sure. I oversimplified things for brevity.

ibex10 · August 28, 2024, 3:04am

Here are some interesting posts: 1 by @aetherealtech and 2 by @David_Smith.

ibex10 · August 29, 2024, 12:09am

Here is another one, by @Karl, which talks about microcontrollers.

You might find it enlightening, especially if you are not familiar with how code runs deep down on the hardware.

ibex10 · September 4, 2024, 12:39am

Here is another one, by @layoutSubviews

ibex10 · September 4, 2024, 1:11am

Here is more stuff from Swift concurrency: Behind the scenes.

With Swift, we want to change the execution model of apps from the following model, which has lots of threads and context switches, to this. Here you see that we have just two threads executing on our two-core system and there are no thread context switches. All of our blocked threads go away and instead we have a lightweight object known as a continuation to track resumption of work. When threads execute work under Swift concurrency they switch between continuations instead of performing a full thread context switch. This means that we now only pay the cost of a function call instead. So the runtime behavior that we want for Swift concurrency is to create only as many threads as there are CPU cores, and for threads to be able to cheaply and efficiently switch between work items when they are blocked. We want you to be able to write straight-line code that is easy to reason about and also gives you safe, controlled concurrency. In order to achieve this behavior that we are after, the operating system needs a runtime contract that threads will not block, and that is only possible if the language is able to provide us with that. Swift's concurrency model and the semantics around it have therefore been designed with this goal in mind.

This is quite interesting if you are curious about the world under the hood.

ibex10 · September 5, 2024, 12:22am

I have started to compile a partial list of evolution proposals related to concurrency.

SE-0176 - Enforce Exclusive Access to Memory (Law of Exclusivity)

SE-0317 - async let bindings

SE-0414 - Region based Isolation

SE-0430 - Sending parameter and result values

SE-0431 - @isolated(any) Function Types

SE-0433 - Synchronous Mutual Exclusion Lock

crontab · September 8, 2024, 3:34am

Is this accurate though? How can it be just a "cost of a function call" if you at least need to dequeue task information from a thread-safe queue before calling a function (and passing the closure context too)?

ibex10 · September 8, 2024, 4:30am

I think they mean it for async functions running in a given task when those functions suspend and resume.

I hope that someone from the Darwin Runtime team clarifies this.

Unfortunately, the official documentation for concurrency comes in the form of videos and their transcripts.

ibex10 · September 12, 2024, 12:02am

Quoting from How to avoid cascading async functions?

How to avoid cascading async functions?

Would something like this work?

func sync_meets_async () {
    ...
    let s = DispatchSemaphore (value: 0)
    Task {
        await async_world ()
        s.signal()
    }
    ...
    print ("waiting for async_world to finish...")
    s.wait()
    print ("finished")
}

func async_world () async {
    try! await Task.sleep(until: .now + .seconds(7))
}

@j-f1 explains nicely why the above attempt is an anti pattern.

How to avoid cascading async functions?

That’s a risky antipattern. If your sync_meets_async function is ever called from Swift’s global concurrent pool (for example, from an actor or async function, even indirectly), you will be blocking one of the finite number of threads in that pool on the expectation that another one of the threads will perform some computation later. While you might not notice any negative effects of this when using the pattern only occasionally, it can deadlock your app or server. Unlike GCD, which expects that people may do this kind of thing and allows a concurrent queue to have more threads than there are logical cores on the system, Swift concurrency requires that you never block one task on the expectation of future work occurring on another task (but offers await as a way for you to do so safely without blocking a thread). So if you fill all of the threads in the global concurrent pool with tasks that are blocking on future async work using a semaphore, none of that async work can ever be scheduled.

But, is there really no practical solution for this that can be used when the sync world meets the async world in real life?

Update: There is a safe solution

David_Smith · September 12, 2024, 1:35am

Yes, there is no correct way to synchronously wait for the result of an async computation. It turns out that this was also true before Swift Concurrency, but the symptoms of the incorrectness were somewhat mitigated, so instead of "it very likely fails" it was "occasionally, in weird edge cases, it will fail, and usually it will just be less efficient".

Which is preferable is a topic of considerable debate: is it best to make it clear something is wrong early, allowing the developer to notice and fix it, or is it best to try to work around it, risking the failure shipping to customers but perhaps never being an issue in practice? I could probably make a half dozen reasonable arguments for either side, although my personal preference is the stricter approach.

ksluder · September 12, 2024, 3:21am

I think this statement is overly broad. I am guessing the “weird edge case” refers to a higher-priority thread blocking on a lower-priority thread, and that thread being such low priority that it never gets a chance to run.

Some programs control all the threads in their process and can therefore ensure the blocking thread runs by being the only runnable thread.

But even in more realistic cases, an app can opt some threads into a scheduler that guarantees all threads will run by calling pthread_setschedparam(SCHED_RR). Lots of games do this for their render and work threads, and we even recommended it at WWDC a few years ago.

And there’s no technical reason the normal Darwin scheduler could not implement priority inversion avoidance for dynamically blocked threads. It just hasn’t yet.

vns · September 12, 2024, 8:48am

The more I think about an ability to make async code sync, the more it feels wrong to the whole (at least in Swift) concurrency model: while the story is about non-blocking solutions, it will allow to introduce blocking behavior to the systems, including APIs that aren’t meant to be such. I think there can be some solution in technical terms, but should this solution exist in the first place?

David_Smith · September 12, 2024, 10:15am

That, but also hitting the thread limit. Whether that’s the default dispatch pool (64ish), the overcommit pool (512), or xnu’s “something is wrong, kill -9 the process” limit (I think it was 1024 about 20 years ago, haven’t looked lately).

ibex10 · September 12, 2024, 1:02pm

Connecting the sync world with the async world.

@Gero posted an example in this thread, which does away with the need to use semaphores when connecting the sync world with the async world.

I have adapted the following example from it to demonstrate its usefulness.

@main
enum sync_async {
    static func main () throws {
        let sa = SyncAsyncAdaptor ()
        sa.enqueue (f)
        sa.enqueue (g)
        sa.enqueue (h)
        
        #if false
        Thread.sleep (until: .now + 19)
        #endif
        
        dispatchMain()
    }
}

func f () async {
    print (#function, "begin...")
    try! await Task.sleep (until: .now + .seconds(7))
    DispatchQueue.main.async {
        print (#function, 1)
    }
    print (#function, "end.")
}

func g () async {
    print (#function, "begin...")
    try! await Task.sleep (until: .now + .seconds(10))
    DispatchQueue.main.async {
        print (#function, 2)
    }
    print (#function, "end.")
}

func h () async {
    DispatchQueue.main.async {
        // cause the program to exit
        print (#function, "exit...")
        exit (0)
    }
}

final class SyncAsyncAdaptor {
    private let continuation: AsyncStream<() async -> Void>.Continuation

    init() {
        let (stream, cont) = AsyncStream<() async -> Void>.makeStream()
        continuation = cont
        Task.detached {
            for await workItem in stream {
                await workItem ()
            }
        }
    }

    deinit {
        continuation.finish()
    }

    func enqueue (_ workItem: @escaping () async -> Void) {
        continuation.yield (workItem)
    }
}

Produces the output:

f() begin...
f() end.
g() begin...
f() 1
g() end.
g() 2
h() exit...
Program ended with exit code: 0

ksluder · September 12, 2024, 5:22pm

Yes, blocking on anything from a concurrent dispatch queue can cause thread explosion. I don’t think you can hit the hard thread limit that way, though. You’d have to be writing code that effectively reimplements concurrent queues with overcommit, at which point you really ought to know what you’re doing :)

David_Smith · September 12, 2024, 9:43pm

You'd think, but serial queues default to overcommit, so it's easy-ish to do this if you do the "each object has a serial queue" model that was popular for a while (and don't retarget them to a non-overcommit queue).

ibex10 · September 15, 2024, 1:25am

After my naive attempt at defining an array of async let bindings:

let fv: [() async -> Int]  = [f, g, h]
let uv: [async Int] = [] // <--- not possible

for f in fv {
    async let u = f ()
    uv.append (u)
}

for u in uv {
    await print (u)
}

I asked the question "Will array of async values be possible?" here .

The answer was not a definite no.

However, @crontab has a utility (Zip) in his personal library, which provides a good workable solution.

It uses a task group underneath, which simplifies the nesting of the task groups, resulting in clean looking code.

@main
enum AsyncZip {
    static func main () async throws {
        @Sendable func n () -> Int {
            let v = [2, 3, 5, 7, 11, 13, 17, 19, 23, 29, 31, 37]
            return v [Int.random (in: 0..<v.count)]
        }

        @Sendable func f () async throws -> Int {n () + 1}

        let u = Zip (actions: [f, f, f])
        try await print (u.result)
        
        @Sendable func p () async throws -> [Int] {
            let u = Zip (actions: [f, f, f, f, f])
            return try await u.result
        }
        
        var v = Zip <[Int]> ()
        v.add (p)
        v.add (p)
        v.add (p)
        try await print (v.result)
    }
}

Possible output:

[18, 30, 30]
[[3, 32, 38, 24, 8], [8, 3, 30, 12, 8], [38, 6, 24, 3, 38]]