Character Comparison Doesn't Follow Unicode Scalar Value?

UniverseObserver · February 4, 2026, 6:50pm

I stumble upon comparing two characters "ὰ" (\u{1F70}) and "ώ" (\u{1F7D}). I was expecting that "ὰ" < "ώ" but swift thinks the opposite. Why is this the case?

let a: Character = "\u{1F70}"  // ὰ
let b: Character = "\u{1F7D}"  // ώ

print(a)
print(b)

print(a < b)  // false

let aScalar = a.unicodeScalars.first!.value
let bScalar = b.unicodeScalars.first!.value
print(aScalar < bScalar) // true

The output is

ὰ
ώ
false
true

ksluder · February 4, 2026, 6:54pm

Characters and strings follow the rules of the Unicode Collation Algorithm. Along other things, the actual result is locale-dependent; do you get the same results if you set your locale to Greek?

xwu · February 4, 2026, 6:56pm

No, Swift standard library operations are locale-independent. Locale-aware operations are vended by Foundation.

UniverseObserver · February 4, 2026, 6:57pm

Thanks! I thought Character is treated differently but it looks like it's just a wrapper around String. In my case I'm only interested in comparing the Characters based on their unicode values.

UniverseObserver · February 4, 2026, 6:58pm

Thanks for the clarification!

xwu · February 4, 2026, 7:03pm

Ah, then UTS#10 linked to above by @ksluder has an important sentence for you to read:

The basic principle to remember is: The position of characters in the Unicode code charts does not specify their sort order.

I guess it must be a common enough misconception that they thought fit to write this out in bold and italics.

YOCKOW · February 5, 2026, 3:35am

I think Swift also doesn't guarantee the sort order of Characters.
In fact, Swift changed its strategy to compare Characters in Swift 4.2.

 let character_u = Character("u") // U+0075
 let character_v = Character("v") // U+0076
 let character_uDiaeresisMacron = Character("\u{01D6}") // ǖ
 print(character_u < character_uDiaeresisMacron) // Prints "true"
 print(character_v < character_uDiaeresisMacron) // Prints "true" in Swift>=4.2, otherwise "false".

xwu · February 5, 2026, 3:45pm

There is an intended sort order (of sorts). The change in Swift 4.2 would have been a bugfix:

github.com/swiftlang/swift

[SR-530] [String] sort order varies on Darwin vs. Linux

opened 07:57PM - 12 Jan 16 UTC

closed 09:34PM - 02 May 18 UTC

mxcl

bug standard library

| | | |------------------|-----------------|… |Previous ID | SR-530 | |Radar | None | |Original Reporter | @mxcl | |Type | Bug | |Status | Closed | |Resolution | Done | <details> <summary>Additional Detail from JIRA</summary> | | | |------------------|-----------------| |Votes | 0 | |Component/s | Standard Library | |Labels | Bug | |Assignee | Lance (JIRA) | |Priority | Medium | md5: 39f03ef4603f0daa8e6d26ebcc5f0250 </details> **Issue Description:** When sorting: ``` java ["app", "deck-of-playing-cards", "FisherYates", "PlayingCard"].sort() ``` I get: [0] = "FisherYates" [1] = "PlayingCard" [2] = "app" [3] = "deck-of-playing-cards" But on Linux I get: [0] = "app" [1] = "deck-of-playing-cards" [2] = "FisherYates" [3] = "PlayingCard" This is with the snapshot \`2016-01-06\`. I'm installing 2016-01-11 now.

sspringer · February 5, 2026, 3:52pm

In addition to the other answer: note that a character can consist of several Unicode scalars.

UniverseObserver · February 5, 2026, 3:55pm

yeah I bumped into this a few minutes after posting this. let c: Character = "\r\n" is considered as a Character