Graph structure made out of value types

tera · January 14, 2023, 11:56pm

I am playing with a tiny hand-made neural network and keep wondering could I use value types instead of classes in it. At the moment it is a graph of reference type nodes which are wrapped around Double with a few extra bits. The nodes are mutable and the graph structure itself, once created, is constant.

I can mechanically change class Value to:

struct Value {
    var id: Int
    var data: Double {
        get { allValues[id].data }
        nonmutating set { allValues[id].data = newValue }
    }
    ...
}

struct ValueContent {
    var data: Double
    ... a few other fields
}

var allValues: [ValueContent] = [ ] // values are allocated from here

But this feels like I've just reinvented reference types.

Is there a good example of a neural network (or just a graph implementation) made out of value types?
Or are value types just not a good fit for this task and I should keep using classes?

anon9791410 · January 15, 2023, 12:43am

How do you address the nodes? It seems impossible to me, without references, but I'm probably wrong.

Here's an example of me doing something similar, switching from reference to value type after initialization.

tera · January 15, 2023, 1:38am

I can do this via "paths":

func value(at: [Int]) -> Value
func setValue(_ value: Value, at: [Int])

root.setValue(..., at: [0, 1, 1, 0, 0, 1])

With the "allValues" approach I can change the values via that "external" array:

for i in 0 ..< allValues.count {
    if allValues[i].type == ... {
        allValues[i].data += ...
    }
}

Without allValues, as the graph I am dealing with is acyclic it is easy to traverse it starting from root. So to set a value I could do:

root.traverse { node in
	if node.type == ... {
		node.value = 0
	}
}

To increment a value of each node is a bit more involved as the node can be encountered more than once in the graph:

root.traverse { node in
    if node.type == ... {
        node.isVisited = false
    }
}

root.traverse { node in
	if node.type == ... && !node.isVisited {
		node.isVisited = true
		node.data += ...
	}
}

rayx · January 17, 2023, 5:31am

Just in case, did you read about adjacency array (or adjacency matrix)? I didn't use it myself, but it's often mentioned as a way to implement graph using value types. Tera's path approach is good for his specific problem domain, but it isn't as general as adjacency array. I use UUID in my app (it's a financial app and data references to each other. The relation is an ad-hoc graph). It may sound awkward to use value type in this case, but I somehow worked out a framework specific to my app and it worked well.

One common issue in value type programming is how to access (including modifying) them. What I learned from my experience is that it usually requires a framework suitable to your problem domain. This might not be obvious to people coming from OOP, because it's an non-issue in reference type.

Take SwiftUI as an example, Apple engineers introduced a lot of SwiftUI specific mechanisms to access/modify value, like @State, @Environment. We need to work out our own mechanism to address similar issues when using value type. Our approaches might not be as elegant as Apple's, but the point is it's a common issue one need to address in value type programming.

tera · January 17, 2023, 10:33pm

Tried a few options with classes and values, got these timing results:

Ref: 92ms
Val + Array: 617ms
Val + malloc: 96ms

In this test a graph of about 2M nodes is traversed a few times during which some nodes are changed based on other nodes. The node is POD struct of 80 bytes (or a class holding that struct).

"Ref" - a version that uses classes
"Val + Array" - a version that uses a minimal struct with integer ID and the data itself resides in a side "allValues" array.
"Val + Array" - ditto but this time the data resides in a malloced buffer

The amount of overhead of array subscript operations (the measured section didn't contain array insertion or deletion, only getting and setting array elements) is quite high (FWIW, there were 836 million getters, and 145 million setters called during this test, in addition to some 200 million math operations).

Interestingly the class version is still a bit faster than the Val+malloc version, despite of anticipated ARC overhead.

As a side note: deterministic random number generator helped tremendously during testing / refactoring.

the algorithm depends on random numbers and to get consistent results across app runs (e.g. to check that bugs were not introduced during refactoring or in a different version of the algorithm) having deterministic / seedable random number generator is super important. For that I'm using drand48.
The app could have benefited from "shuffle" call but that's based on Swift random which doesn't give a deterministic sequence, so I had to reimplement shuffle myself.

jrose · January 18, 2023, 12:04am

Can you post your source code somewhere? There are a few things that could make that Array number higher than it should be:

if the Array is a global, including in script mode, necessitating run-time enforcement of exclusivity
if the elements are set while there's still another reference to the Array, forcing a copy

tera · January 18, 2023, 2:02pm

Your suggestion about run-time exclusivity checks is spot-on: if I disable it – Array subscripts are significantly faster (still slower than unsafe pointer subscripts, but that's probably expectable, no?).

This is a much simpler version that models what the app is doing. It has about the same speed difference between Array and malloc version (a bit higher difference because it is doing so little per array / buffer access compared to the whole app):

// Release build, exclusive access to memory checks: on
// 3263235 µs -  Array version
// 417975 µs - malloc version
// 7.8 times slower

// Release build, exclusive access to memory checks: off
// 972422 µs - Array version
// 401312 µs - malloc version
// 2.4 times slower

import Foundation

let n = 1 << 21 // ~ 2 million
let mask = (1 << 21) - 1
var array = [Double](repeating: 0, count: n)
var buffer = malloc(n * MemoryLayout<Double>.stride).assumingMemoryBound(to: Double.self)

func initBuffers() {
    for i in 0 ..< n {
        array[i] = Double(i)
        buffer[i] = Double(i)
    }
}

struct Value2 {
    subscript(array index: Int) -> Double {
        get { array[index] }
        nonmutating set { array[index] = newValue }
    }
    subscript(buffer index: Int) -> Double {
        get { buffer[index] }
        nonmutating set { buffer[index] = newValue }
    }
}

class Test {
    
    init() {
        initBuffers()
    }
    
    func test() {
        let mask = n - 1
        var sum = 0.0
        let v = Value2()
        
        var k = 0
        let start1 = Date()
        for _ in 0 ..< 100 {
            for i in 0 ..< n {
                let index = (i * k) & mask
                let val = v[array: index]
                v[array: index] = val * 1.0001
                k += 1
                sum += val
            }
        }
        let elapsed1 = Date().timeIntervalSince(start1)
        
        let start2 = Date()
        for _ in 0 ..< 100 {
            for i in 0 ..< n {
                let index = (i * k) & mask
                let val = v[buffer: index]
                v[buffer: index] = val * 1.0001
                k += 1
                sum += val
            }
        }
        let elapsed2 = Date().timeIntervalSince(start2)
        
        print(sum)
        print("\(Int(elapsed1 * 1000_000)) µs")
        print("\(Int(elapsed2 * 1000_000)) µs")
        print("\(elapsed1/elapsed2) times slower")
        print()
    }
}

let test = Test()
test.test()