I am working on a PR to add Linux support to opentelemetry-swift. I have everything working locally in an Ubuntu VM and am trying to add a github actions job to run the tests on ubuntu-latest.
Unfortunately the linux tests seem to stall out after some number of tests run. First I was using the swift:5.10 docker image so I tried switching to using swiftly to install and run the tests just in case the docker image was somehow causing a problem (even though that also worked locally). I also tried removing the overrides of some XCTest methods just in case that was somehow breaking things, but that also has not made any difference.
I let one of the jobs timeout yesterday, and interestingly the raw logs indicate that nothing was printed between when the job appeared to get stuck up until the moment the job was cancelled, at which point some XCTest output was apparently printed as the job was being cancelled. That seems somewhat suspicious but I'm not really sure what to do with the information either. raw logs - actions summary
So far I've only seen it stall out after AggregationsTests.testDropAggregation
passes, or somewhere in the Base2ExponentialHistogramAggregationTests
test case, which seems very odd. Neither of those test cases are doing anything remotely interesting though.
The tests aren't being run in parallel so there really shouldn't be significant differences between running it locally and running it in github actions. Obviously something is very consistently breaking in one environment and not the other though.
Any thoughts on what might be happening here?