a long time ago in a galaxy far away, SE-0292 defined a scoped format for package identifiers, where a package id would be allowed to contain up to one dot.
the intent was likely for people to use the scope as a namespace, so you might have apple.package-a, apple.package-b, tayloraswift.package-a, etc.
this of course, wasn’t what happened. many people seem to have started using the dots artistically, and now we have packages named progress.swift, grdb.swift, etc. SPM now interprets progress as the scope and swift as the local package name, which completely negates the usefulness of the scoped identifier format.
are SE-0292 scoped package identifiers still relevant today?
It feels like a bug that we're treating URL-based packages as if they were registry identifiers? I think only if package(id:from:) and such are being used we should make that assumption.
i’m not sure if the identifiers in Package.resolved are full identifiers or just local identifiers. but based on the relevant section of SE-0292, it suggests that we should be parsing progress as the scope and swift as the local identifier.
yes, i am trying to combine docs for packages on the swiftinit index, and i was considering using the owner’s GitHub username as the scope within the package database. this implies SPM must enforce the rules for allowed package name characters defined in SE-0292:
A package's name uniquely identifies a package in a scope. A package name consists of alphanumeric characters, underscores, and hyphens. Hyphens and underscores may not occur at the beginning or end, nor consecutively within a name. The maximum length of a package name is 100 characters. A valid package scope matches the following regular expression pattern:
(i am assuming here that package scope here is a typo, since there is an analogous grammar in the paragraph above it.)
the error i got was that the documentation compiler encountered a remoteSourceControl package that already has a scope, and that is unexpected. if we automatically strip the superfluous scope, we get a package named swift, which is not a very useful package name.
I don't think the rules for a package identifiers map to the rules for GH usernames and repositories (which I think is the problem you're facing).
Also note that package identifiers aren't globally unique, they're unique per registry. So if swiftinit is a global database of sorts, using package identifiers won't necessarily work since they can be duplicated across registries.
SPM always uses the last URL path component as the package name for assigning pin identifiers. what seems to have happened is that because GitHub permits dots in repository names, and the repository name is the last URL path component in a GitHub repo dependency, that SPM (transitively) allows dots to appear in package names.
this means there exists a significant population of swift packages that have no legal name today under SE-0292, which is why i asked if SE-0292 is still relevant today, since these packages obviously aren’t going anywhere and we have to support them.
one possible, very long term, direction for the site is for it to become a package registry.
Again, SE-0292 was never meant to add restrictions onto URLs of git-based packages or local dependencies, it is only supposed to govern the scheme of identifiers as used by .package(id:from:) and similar APIs. This may or may not be 100% reflected in the text of the proposal, but that was the intent.
It depends, for the most part, we're just using the identity to disambiguate inside a particular package graph. We have CanonicalPackageLocation and CanonicalPackageURL as identifiers for situation where we need something more "global", e.g. for the shared cache of git checkouts.
i suppose that could work in the short term, but i could envision some issues in the long term with URL identities as packages migrate between organizations and people change their GitHub usernames.
Right, the way registries are supposed to handle this is that they would know a number of aliases for one logical identity.
So e.g. apple.swift-nio would also map to https://github.com/apple/swift-nio and any number of alternative URLs and identities.
This does hinge on there being some sort of administrative side of things where users can claim scopes and relevant URLs with some kind of human validation to prevent abuse.
i was afraid it was going to involve something like this…
my goal for swiftinit is to index package documentation in a way that is durable over time. this requires packages to have stable identities, so the dependency graphs don’t rot as people rename things. but implementing a package registry is very far out of the current scope of the documentation index.
anyway, thanks for the clarifications. i’m going to need to come up with a different way to persist dependency graphs that doesn’t rely on concepts of package identity.
I wonder if the GH API exposes the various old URLs for a repository? I know that they keep redirecting across renames so the information must be there at least.
the challenge is more so with the internal consistency of the package database, because i am using the package ID as SPM defines it as a primary key in the database.
swiftinit doesn’t build docs directly from source, it stores binary symbol graphs on the server-side. building docs for a package involves compiling a binary symbol graph and then linking it against the symbol graphs for its dependencies. the dependencies are loaded by looking up packages by name, as that is how the dependency tree is currently encoded within the symbol graph metadata.
Literally my first day trying to develop with swift, so still trying to figure out up from down and learning about the community, but felt like I might be able to positively contribute to the thread
I believe GH will redirect until someone else claims the previous username/reponame or blocks the creation of the old_username/reponame combination under certain conditions[1]
This was recently (2023) implemented by GH to mitigate "repojacking" vulnerabilities described in one this years DEFCON presentations[2]
Coming devops/platform engineering background and needing survive dealing with the sharp edges of python[3]/golang/ruby/terraform/cloudformation packages/registries and respective methods of dependency management, attempting to learn how Swift Packages/Namespacing worked has been one of my first steps into the swift ecosystem and how I stumbled across this thread.
Looking forward to future improvements in this area