Embedded Swift

Joe_Groff · March 28, 2024, 2:35am

We wouldn't change anything about String on existing platforms; on embedded platforms, there would just be a subset of its existing functionality. Being able to still use String and its views in embedded modes would have the benefit of existing code being easier to port. [UInt8] also isn't and probably shouldn't be ExpressibleByStringLiteral or -Interpolation, but String can still implement both of those on embedded platforms.

tera · March 28, 2024, 3:08pm

Weak specifically, or unowned as well? Unowned are easier to do, even if they are unsupported (I don't know if they are or aren't) – store the unsafeBitCast(reference, to: Int.self) integers in the dictionary and do the reverse unsafeBitCast(int, to: RefType.self) when querying the dictionary.

bbrk24 · March 28, 2024, 3:12pm

Don’t Swift objects maintain two refcounts, one for strong and one for weak/unowned? This wouldn’t touch the unowned refcount.

tera · March 28, 2024, 3:31pm

As soon as this is done – people who had those will throw away their custom String implementation and use the standard library one.

I don't have hard numbers but it feels that in the majority of cases of embedded platforms the actually used API surface is quite small: count, ==, hash, possibly <, init ascii/utf8 strings from data, convert them back to data, converting numbers to strings, and back and coding/decoding strings which are part of JSON. Alphabetically comparing strings would be probably the most complicated but those who could tolerate using ASCII-only strings would be happy using a simple < instead (which would not give correct results for unicode strings).

David_Smith · March 28, 2024, 10:11pm

Three, actually. The strong and unowned recounts are in the object, the weak refcount is outside it.

Dmitriy_Ignatyev · March 28, 2024, 10:19pm

@David_Smith Can you please share what is the max possible number of strong / weak / unowned references? I mean are they limited to UInt16 / 32 / 64?

jrose · March 28, 2024, 10:19pm

== is one of the hard ones, unfortunately! "Full" Swift ensures that "{E WITH ACUTE ACCENT}" is equal to "{E}{COMBINING ACUTE ACCENT}". So we'd need to pick one of the following:

Embedded Swift includes the subset of Unicode tables necessary for canonical equality (if you use String)
Embedded Swift defines String equality differently than "Full" Swift
Embedded Swift does not make String Equatable (which means it wouldn't be Hashable either)

David_Smith · March 28, 2024, 10:21pm

It will fall back to external storage above a platform dependent threshold, although there are cases where this hasn’t worked correctly before ( see commit history for this file swift/stdlib/public/SwiftShims/swift/shims/RefCount.h at main · apple/swift · GitHub )

John_McCall · March 28, 2024, 10:49pm

Right. Of course, we could still add those conformances to String.UTF8View (and maybe the other views) with the explicit understanding that they're based on the code unit sequence. I don't know if there are good arguments against doing that, though.

David_Smith · March 28, 2024, 11:12pm

@shantini has a PR for that: [SE-NNNN] Add Equatable and Hashable Conformance to String views by pershanti · Pull Request #1637 · apple/swift-evolution · GitHub

As far as I know we haven't landed it solely due to concerns about retroactive conformance conflicts causing compatibility breaks, it would be great to sort out how to do it.

Joe_Groff · March 28, 2024, 11:14pm

I agree, that would be useful even on desktop Swift, where oftentimes code unit comparison is all you want or need for performance or correctness needs, and it would be a nice way of portably expressing code unit comparison across desktop and embedded Swift.

tera · March 29, 2024, 12:07am

There's also a 4 for completeness:

Embedded Swift supports ASCII strings only.

I'd say #4 or #2. Option #3 would be too restrictive. Not sure about #1.

maartene · March 29, 2024, 9:05am

I’d probably get a lot of mileage out of just UInt8/ascii strings alone, so would welcome option 4.

Nevin · March 29, 2024, 2:52pm

I’m not sure if this has already been suggested, but perhaps an AsciiString type could be added to Swift so it is available on all platforms. I expect this would be generally useful for many people.

Then embedded Swift could say something like, “String is not available, try using AsciiString instead.”

carlynorama · March 29, 2024, 7:21pm

+1 thinking full Swift String capabilities aren't needed in Swift Embedded, but not 100% sure what the future holds.

As general rule if find myself trying to do any kind of real work with Strings on an embedded device that's typically a sign I've probably made an architecture mistake somewhere. But now we have a world with tiny little screens that are super cheap and everywhere. And screens seem to attract emojis like dust on a cat's whiskers.

FWIW, whole new type or mask on same-named-type is one aspect the MicroPython vs CircuitPython philosophical fork. (both excellent projects)

Swift is very different from Python, but it still might be interesting to look at some of the decisions they've made.

MicroPython or CircuitPython? | Getting Started with Raspberry Pi Pico and CircuitPython | Adafruit Learning System
MicroPython + CircuitPython - Talk Python to Me Ep.325 https://www.youtube.com/watch?v=VKBmeFb7zHY (1hr)
MicroPython - string Data Type
MicroPython differences from CPython — MicroPython latest documentation
io – input/output streams — MicroPython latest documentation
Standard Libraries — Adafruit CircuitPython 9.1.0-beta.0 documentation (anchor to omitted string functions)

CP Examples for anyone wandering into this thread whose never worked with human interface and text on a micrcontroller or to see the compromises CircuitPython made:

Emoji Keybord (puts together characters to send) CircuitPython Code Walkthrough | NeoKey Emoji Keyboard | Adafruit Learning System
Text display project Code PyPortal with CircuitPython | PyPortal Event Count-Up Clock | Adafruit Learning System

mickeyl · March 29, 2024, 8:24pm

That‘s a very good plan!

tera · March 30, 2024, 12:49am

Or:

typealias String = AsciiString

Karl · March 30, 2024, 1:10am

I wonder if that's really wise, or if it would be a bit like defining:

typealias Double = Float

or

typealias Int64 = Int16

tera · March 30, 2024, 2:22am

This would be a more reasonable parallel:

struct Int64 {
    var components: (Int32, Int32)
    // a limited subset of Int64 API follows.
}

JanWillemBrands · March 30, 2024, 9:10am

Right, anything embedded that's of any real interest will need to address world-wide markets and their genuine UI demands. ASCII is good enough for debugging and AdventOfCode :) Option #1 looks attractive.