Integer generic parameters

Joe_Groff · September 3, 2024, 3:35pm

jrose:

Seconding this, it’s been talked through before both here and in Rust (which is currently C-like, but has considered what it would take to go Swift-like with separate size and stride). As far as I know, there’s actually nothing that requires the size of [T * N] to be MemoryLayout<T>.stride * N, rather than MemoryLayout<T>.stride * (N-1) + MemoryLayout<T>.size (yes, I know that’s wrong for 0, don’t worry about it). Anything that reads or writes the entire value in a typed way will use the correct size, whatever we decide that is; anything that only goes element-by-element will never read any of the tail padding at all. The only problem comes if someone reads or writes the “full” N * stride when only the unpadded length was allocated. And that can certainly happen! But it implies someone is manually calculating sizes that they hopefully don’t need to anymore. (Unfortunately, if they’re doing it at all, it’s already in unsafe-pointer land, so there’s not an obvious way for the compiler to catch it.)

There is a performance cost, just as there is with any size vs stride difference: care must be taken to preserve the elements in the tail padding, which may mean more instructions to avoid touching extra memory that doesn’t belong to you. But that isn’t really new in Swift, though it should probably go into a performance guide for fixed-sized arrays.

In Swift's case, there's already a decent amount of code out there using UnsafeBufferPointer for bulk data operations, and the underlying array value witness operations in the runtime that they use assume that an "array" is stride-padded. We want new contiguous collection types to be compatible with existing practice, and Span to be a mostly drop-in replacement for unsafe types, so IMO we should stride-pad any type that's used for array storage.

rvsrvs · September 26, 2024, 3:26pm

Was playing with this over the past week using the ValueGenerics experimental feature. I discovered that almost the first thing I wanted to do was make use of integer parameters variadically, but that this was clearly denoted as a future direction rather than as part of the overall design.

As a (future) review comment, it seems odd to introduce this feature and special case treating these parameters unlike all others. It would seem much more consistent to allow me to specify an integer parameter in any context where I specify a non-integer param.

bbrk24 · September 26, 2024, 4:54pm

The only case I can say I’ve encountered in my code is initializing a UUID from a bunch of integers. Which begs the question: even if we can do _vector for struct properties, how will functions be handled?

rvsrvs · September 26, 2024, 5:51pm

i'm curious could you elaborate on what you are thinking in re funcs?

Datagram · September 29, 2024, 11:53am

I'm very excited to discover this pitch and see the progress towards Swift's native support for fixed-size arrays.

I was starting to despair that fixed-size array support would ever come.

And a big yes the sooner we have the possibility of importing fixed-size array from C, the better it will be even if this causes a break in source compatibility given the pain and limits of the current import in the form of tuples.

scanon · September 29, 2024, 6:22pm

FWIW, I'm cautiously optimistic that we can adopt eventual fixed-size array importing without breaking source compatibility.

Pippin · October 10, 2024, 9:08pm

Why couldn't we have the disambiguation be determined by Self? We already have common methods for disambiguation with self and these parameters are tied to the type. Having just read the Generalized Collection APIs section for the Vector pitch, it looked completely off to me. Now, the problems to solve is whether the self or Self usage gets inferred to omit the prefix and how clashes would be used on instances of the type... maybe living within the meta type Type /think

What we all know and use


struct Foo {
    
    let bar: Int
    
    func doStuffWithBar() {
        var bar = self.bar
        // stuff
    }
}

Vector with count and Collection.count:

struct Vector<let count: Int, T> { /* ... */ }

extension Vector where Element: ~Copyable {
    // ....

    public var count: Int { Self.count }
}

Lucca-mito · October 10, 2024, 10:04pm

Unfortunately, we can’t refer to a type’s generic parameters using Self. since they’re not actually static properties of the type. So, in your example, Self.count does not exist. To prove this using current Swift, the code below:

struct S<T> {
    static func foo() {
        print(Self.T.self)
    }
}

S<Int>.foo()

Fails to compile with the following message: type 'S' has no member 'T'. (Of course, it would work if we wrote T.self instead of Self.T.self, but then we're back where we started in terms of disambiguating Vector's count.)

So, to use the lowerCamelCase convention on Vector’s integer generic parameter, I believe we’d have to call the generic parameter something else like size, length, capacity, fixedCount, etc.

tera · October 10, 2024, 10:11pm

Well, this is allowed:

struct Vector<count> {
    var count: Int { MemoryLayout<count>.size }
}

which suggest that this should also be allowed:

struct Vector<let count: Int> {
    var count: Int { count }
}

no?

Pippin · October 10, 2024, 10:52pm

This is the first proposal where a value is a property of a type, it seems like they should be. At the same time, I think of this as a way to just combat the potential ambiguation we will face.

I haven't looked throughout the forums, but almost feel like there should be a way to expose generic parameter types like the example syntax. (Edit: er, pre-some syntax, maybe not anymore)

fan · October 15, 2024, 6:32pm

It's great to see this finally proposed after four and a half years

georgeef · December 26, 2024, 9:18pm

I would rather write the first example in this proposal as:

struct Vector<N, T> where N: SingleInt {
  /*implementation TBD*/
}

in which N is a (singleton) type, which happens to contain only one value, and this value happens to be (interpreted as) an Int.

What is the need for the special syntax let N: Int instead of just N and a conformance to a protocol? I don't know the implementation details, maybe I miss something.

A few comments on the above syntax:

N is a type, not a value, so it is capitalized.
let n: N is a valid declaration. No need for an assignment, because n can take only one value (and it can be eliminated in optimization).
var n: N? is a valid declaration, which assigns the default value nil.
The single integer value in N can be expressed with something like N.value (similar to Int.max).

Regarding parameter arithmetic: this is something special to this kind of "types", and maybe the most important feature. The expression N + M, where N, M: SingleInt can be considered as type arithmetic as well. The result is another type, also conforming to SingleInt (assuming there is no overflow).

Regarding future directions: there is UInt for non-negative integers, but little support for other common integer ranges. It would be nice to have something like 1+UInt for positive (and non-zero) integers. Also the trivial singleton type ZInt which contains only 0.

With some (limited) type arithmetic, a few common types could be expressed:

n + ZInt is the type containing only n
n + UInt is the type of integers greater than or equal to n
n - UInt is the type of integers less than or equal to n
2 * UInt + 1 is the type of positive odd integers

The type 1+UInt of positive integers is not the same as the range 1... of positive integers. The first is a type, while the latter is a value. A value of type 1+UInt is a single integer, not a range.