-
Notifications
You must be signed in to change notification settings - Fork 394
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Jenkins jobs lose contact with AArch64 macOS machine #6774
Comments
See the examples below. https://ci.eclipse.org/omr/job/PullRequest-osx_aarch64/17/consoleText:
https://ci.eclipse.org/omr/job/PullRequest-osx_aarch64/20/consoleText
https://ci.eclipse.org/omr/job/PullRequest-osx_aarch64/22/consoleText
https://ci.eclipse.org/omr/job/PullRequest-osx_aarch64/23/consoleText
Other recent jobs ran to the end of the tests on the same machine, mac11-aarch64-08. |
fyi @AdamBrousseau |
@AdamBrousseau Is there any way for collecting more information on what happens when the "Cannot contact mac11-aarch64-08" message appears? |
This is a machine that is shared with the OpenJ9 Jenkins farm. We use a different username to ssh to the machine. Interesting. We had made the change to connect as I will follow up on the core files from OpenJ9 farm. |
Two failures in a row today. https://ci.eclipse.org/omr/job/PullRequest-osx_aarch64/32/
https://ci.eclipse.org/omr/job/PullRequest-osx_aarch64/33/
|
The macOS on Apple Silicon build with OMR doesn't run cleanly at the moment. @knn-k is gradually fixing the functional problems though. I would say taking it offline for a few days won't have a big impact at this stage. |
It is acceptable to temporarily disable amac builds. |
Still failing frequently -- 4 jobs out of recent 10: |
5 jobs (48, 49, 50, 53, 54) out of 8 failed with |
Somehow the scc for the java running the jenkins agent is getting corrupted. I would assume there's a problem with the jdk or caused by the omr testing. Not sure. I can give dev(s) access to the machine if looking at the javacore etc files will help. |
Tried running a test by hand. It hangs then gets booted off the machine.
Rebooting now.... |
Same thing. Could there be something in those tests that is causing an issue? |
Thank you, @AdamBrousseau. |
There are two tests in It could be a problem of macOS's tolerance to high CPU load under a certain condition. |
Attached is a simplified standalone C test program for M1 Mac with macOS 11.7.4 crashes by running this program. How can I report it to Apple? |
I opened PR #6903 for disabling the test for the time being. |
I also tried running Which version of macOS does mac11-aarch64-08 run? |
11.7.1 |
#6903 was merged, and the Jenkins job runs without the OS reboot now. |
Can this be closed now @knn-k ? |
My local test shows macOS 13 is more robust than macOS 11 against frequent calls to Let's close this issue after the PR is merged. |
#7064 has been merged. Closing. |
Other remaining issues with amac builds: |
PR #6637 added AArch64 macOS machines to the pipeline recently.
I see the Jenkins jobs on AArch64 macOS often fail with the following exception in the middle of running tests:
The text was updated successfully, but these errors were encountered: